Gene Cthe_1823 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_1823 
Symbol 
ID4809807 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp2157057 
End bp2158316 
Gene Length1260 bp 
Protein Length419 aa 
Translation table11 
GC content45% 
IMG OID640107237 
Productextracellular ligand-binding receptor 
Protein accessionYP_001038237 
Protein GI125974327 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0683] ABC-type branched-chain amino acid transport systems, periplasmic component 
TIGRFAM ID[TIGR03407] urea ABC transporter, urea binding protein 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.158541 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATAAAGA AGTTTTGCAT ATTTGGCAAA AATACAATTA AGGTTTTGGC AATTGCACTG 
TCGGCACTGC TGGTATTCGC AGGCTGTTCC GGCAAAGTGG AGGAGCCGGT TGATAACAAA
CCCGGAACTG ATACTTCCGC AGAAGACACC ATAAAGGTGG GAATTCTTCA CTCCTTAAGC
GGAACCATGG CTATTAGCGA GGTATCCCTC AAAGATGCGG AATTGATGGC AATAGAAGAA
ATTAACCAGG CCGGAGGTCT GCTGGGCAAA AAAATTGAAC CGGTGATTGA AGACGGAGCT
TCCGATTGGC CTACTTTTGC AGAAAAGGCA AAGAAACTGC TCCAAAATGA CAAGGTTGCA
ACCGTTTTCG GATGCTGGAC TTCAGCCAGC CGTAAAGCCG TATTGCCGGT GTTTGAAGAA
AATAACGGAC TTTTGTGGTA TCCGGTGCAG TATGAGGGCA TGGAGTCATC ACCAAATATC
TTCTATACCG GTGCGGCACC CAATCAGCAG ATTGTTCCCG CAGTCGAATG GCTTTTGGAA
AACAAGGGAA AAAGATTTTT CCTCCTTGGC TCCGATTATG TATTTCCCAG AACCGCAAAC
AAAATTATCA AAGCTCAGCT AAGCGCCATA GGTGGGGAAC TTATTGCAGA GGAGTATACT
CCTTTGGGTC ATACCGATTA CAGTACCATT GTAAATAAAA TTAAAACCGC AAAACCGGAT
GTAGTGTTTA ACACCCTGAA CGGGGACAGC AATGTTGCCT TCTTCAAACA GCTCAAGGAT
GCGGGAATCA CGTCTGAAGA CATTACCGTT TGTTCTGTAA GTGTTGCAGA AGAAGAAATA
AGGGGTATAG GCGCTGAAAA TATAAAAGGT CACCTGGTTT CATGGAACTA TTACCAGACT
ACGGATACCC CGGAAAACAA AGAGTTTGTG GAAAAGTACA AATCTAAATA CGGAAGCGAC
AGGGTTACCG ATGATCCCAT AGAAGCGGCA TATATAGCAG TTCATTTGTG GGCTGAGGCA
GTTAAAAAGG CCGGTTCCTT TGAGGTGGAA AAGGTTAAGG AGGCAGCCAA AGGACTTGAA
TTTAAAGCTC CTGAAGGGCT TGTGAAAATT GAAGGAGAGA ACCAGCACCT GTGGAAGCCG
GTGAGGATTG GTGAGGTACA GGAAGACGGA CTTATCAAGG AAATCTGGAG TACAAGTGAA
GCCGTAAGGC CCGACCCATA CTTAAAAACC TACGACTGGG CAAAAGGCTT AAGCGATTAG
 
Protein sequence
MIKKFCIFGK NTIKVLAIAL SALLVFAGCS GKVEEPVDNK PGTDTSAEDT IKVGILHSLS 
GTMAISEVSL KDAELMAIEE INQAGGLLGK KIEPVIEDGA SDWPTFAEKA KKLLQNDKVA
TVFGCWTSAS RKAVLPVFEE NNGLLWYPVQ YEGMESSPNI FYTGAAPNQQ IVPAVEWLLE
NKGKRFFLLG SDYVFPRTAN KIIKAQLSAI GGELIAEEYT PLGHTDYSTI VNKIKTAKPD
VVFNTLNGDS NVAFFKQLKD AGITSEDITV CSVSVAEEEI RGIGAENIKG HLVSWNYYQT
TDTPENKEFV EKYKSKYGSD RVTDDPIEAA YIAVHLWAEA VKKAGSFEVE KVKEAAKGLE
FKAPEGLVKI EGENQHLWKP VRIGEVQEDG LIKEIWSTSE AVRPDPYLKT YDWAKGLSD