Gene Cthe_1586 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_1586 
Symbol 
ID4809577 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp1916130 
End bp1917785 
Gene Length1656 bp 
Protein Length551 aa 
Translation table11 
GC content38% 
IMG OID640107004 
Productbinding-protein-dependent transport systems inner membrane component 
Protein accessionYP_001038005 
Protein GI125974095 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1178] ABC-type Fe3+ transport system, permease component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATACCG CAAAAAAGAA ATTGAACATA TGGCTGGTAT TGGCATTGGC AATACTAGCT 
TTGTTTCTTA TTTTTGTAGT GTATCCGATA GGTCTTATTC TATATAAAAG TATTTTGATG
GAAGATGGCA CAATAAGTTT TTCTTACTTT ACGAAATTCT TCTCCAAAAA GTTTTATTGG
AGTACGCTGG TTAACAGTTT TAAGGTTACC ATAGCTTCAA CTTTGGTATC AGCTGTTTTG
GGACTAATTA TGGCTTATTT GTTAAGAAGC ATAAAGATTA AAGGCAGCAA GTATTTAAAC
ATTCTTATTG TTATATCTTA TTTGTCACCT CCGTTTATAG GAGCCTATGC ATGGGTACAG
CTTCTTGGAA GAAACGGAGT AATAACAAGA ATTTTAAATT CTGTTTTCAA TGCTAAACTG
GGCGGTATTT ATGGATTTGC AGGTATTGTT CTTGTTTTCT CATTGCAGTC TTTCCCATTG
GTTTATATGT ATGTTTCAGG CGCATTGAAA AACTTGGATA ATTCCTTGAA TGAAGCGGCA
GAAAGCCTTG GATGTTCAAC TGTTCAAAGG GTTTGGAAAG TTATTGTGCC GTTGATAACT
CCAACGTTGC TGGCCAGTTC ACTGTTGGTT TTCATGCGTG TCTTCTCAGA CTTTGGTACG
CCGATGCTTA TCGGTGAGGG TTACAAGACA TTCCCGGTAC TATTGTATAG CCAGTTTATG
GGTGAGGTAA GTACCGACGC CCACTATGCA GCAGCCCTTT GTGTTATTGT AATTGTAATT
ACATTGGTTT TGTTTTTCTT GCAGAAGTAC ATTGGAAATC GTCTGACCTA CTCTATGTCA
GCGTTAAAAC CGATGGAGCC GCAAAAAGTG ACCGGGATTC GCAATGTTCT TGCCCACGGC
TTTGTATATT TAGTTGTATT GGCTGCAATT TTACCGCAGT TAACAGTAAT TACCACATCC
TTCCTGGAAA CAAGAGGTGC TTCATATACC GGTCAATTTA CTTTGCAAAA TTATAAAAAT
ATTATAATGC CGAAAAATAT CAGTACAATA ACCAACACGT ATCTATTTGG TTTGGCTGCA
ATAATATTGG TTGTAATTCT GGGAGTATTG ATATCTTACT TAACGGTAAG AAAGAGGTCT
TTCTTAACCT CCATACTGGA TACATTGACA ATGTTCCCAT ATATTATACC CGGTTCCGTA
TTGGGTATTT CATTCTTGTA TGCGTTTAAC AAAAAGCCGC TGTTACTTAG CGGAACAGCA
ATCATAGTTA TTATTTCATT GTGTATCAGG CGTATGCCTT ATACAATCCG TTCCAGTACG
GCTATAATAG GACAGATAAG TCCAAGTGTT GAAGAAGCTG CAATCAGCCT GGGATCTACA
GAGGTAAAGA CATTTGTAAA GATAACTGTA CCGATGATGA TGGCCGGTGT ATTATCCGGT
GCAATTATGA GCTGGATTAC CCTGATTAGT GAGCTGAGTT CCTCCATTAT TCTATATACA
AGTAAAACTC AGACTTTGAC TGTGGCAATT TATACGGAAG TCATTCGCAG TAACTTTGGA
AATGCGGCTG CCTATTCAAC GGTATTGACA ATTACCAGTG TTTTATCCTT GCTGTTGTTC
TTTAAGGTAT CGGGAAGTGA GGATATAAGT GTATAG
 
Protein sequence
MNTAKKKLNI WLVLALAILA LFLIFVVYPI GLILYKSILM EDGTISFSYF TKFFSKKFYW 
STLVNSFKVT IASTLVSAVL GLIMAYLLRS IKIKGSKYLN ILIVISYLSP PFIGAYAWVQ
LLGRNGVITR ILNSVFNAKL GGIYGFAGIV LVFSLQSFPL VYMYVSGALK NLDNSLNEAA
ESLGCSTVQR VWKVIVPLIT PTLLASSLLV FMRVFSDFGT PMLIGEGYKT FPVLLYSQFM
GEVSTDAHYA AALCVIVIVI TLVLFFLQKY IGNRLTYSMS ALKPMEPQKV TGIRNVLAHG
FVYLVVLAAI LPQLTVITTS FLETRGASYT GQFTLQNYKN IIMPKNISTI TNTYLFGLAA
IILVVILGVL ISYLTVRKRS FLTSILDTLT MFPYIIPGSV LGISFLYAFN KKPLLLSGTA
IIVIISLCIR RMPYTIRSST AIIGQISPSV EEAAISLGST EVKTFVKITV PMMMAGVLSG
AIMSWITLIS ELSSSIILYT SKTQTLTVAI YTEVIRSNFG NAAAYSTVLT ITSVLSLLLF
FKVSGSEDIS V