Gene Cthe_2612 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_2612 
Symbol 
ID4809034 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp3087105 
End bp3090941 
Gene Length3837 bp 
Protein Length1278 aa 
Translation table11 
GC content42% 
IMG OID640108026 
Productfibronectin, type III 
Protein accessionYP_001039005 
Protein GI125975095 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAATGA AAAGAGCGGG AAGAGTTGTC GGATTAATAT TGGCAGTAAG TCTCGTTTTG 
CAATTTAATA TAATAAATAC GGTTTGGGCT GTCGACCCGG AGCCTCAATC CCCACCGTCA
AAGCTTAGGA TTGAGCCGCA AAGTCCGGAT GAGCCTGCCA TTGGCTATAA TGAATTTGAC
AAATATTATG TTGACCTGAA ATGGGACGTA AGTTTTCCTT CCTTTGCGAT TTCCAAATAT
CTTAACATCT ACACGCAGGA AATCCCCAAG TCTTACAGAA TAGCAAAACC TCGCAGTGTG
AAAGCAAAGG ATGTTTCCGG AAATTCGAAC TCATACCGCC TGAAGGAGCT CAATTCAGGT
ACAATTTACT ATATTGATGC GACTGCCTCT TACACGTATG TCGAGGACAG CAAGCTATAC
AGAAGTGCGG AATCGGCTGC TTCAAACAGG GTGAAGGTCT TGACCGAGAT TGATATCAGC
GCATATGCAG TTTCCACAAA CAAGATTAAA ATAGAATGGG ATGATGTGTG GAATACTGAC
GGAAGAATAG GTTACAAGCT TTATATTTCC GAAAACGGCA GTTTTGCCAA TACGCCGCCG
ATATACATAG GAAAAGACCA GATAGGCCCG GACAAGCCGG TAAAAGTTAA TGAATCAACG
GGAAAGCTGG AGTATATTCA TACCGCAAGA GACCCGGGAA GGGTTTATTA TATCAGAATA
GAACCGGACG TAAATGATGC GGAGCTTAAG AAAAACCAGT ACAGCAAAAC CGTTATTGTG
AGTAGTTACA TTTTGGTCAG GACAACAAAA ATAGCTTCAA CGGAATCCGG GGTTATTTGG
AAGCTGGAAT GGAGTCCGGT TGTTACCAGC CTGAGTGACA GCAATGTAAA AGTCAGCTAC
CAGATTTACA GAGGTCAAAT AGATTCCACC GATCTGGCCC AGTATATGGC CTCTGTGGAC
GGCACCGAGT TTTTTGTCAC GCTTCCGCCC GGTGAGGTTG AACATTATTT TATAATAAGG
GCTATTGTAA CCAAAGACGG GCTTGACGTG TATGAGGGCA TAAGGATAGA ATCCGAACGG
ATAATAGTAA GGGAGCATGA AACACCTTCT TATCCTGCGG CTCCCGTACT TGTGGACAAA
TTTGAAAAGT CTCCGGGAGA AACGATTATA AGCTATGATG AGGAATTAAA ACCCAATAGT
GCAACAATTT TGTGGAAAGT TCCCACCCGC GGAGACGGGC AGATAGACAC CGATATTATG
TATGACATAT GGCTTGTGGA CGATCCGAAC CTTATTGACA ATCCGCCGGA GGGCAGAAAA
ATTGCTTCAA ACATATCAAT GGGAAGCAGC AACTATGTAA TAAGCGGAGA TACGGTTATA
GGTTATAAAT ATGTTGTTTC AAATTTGACT CCCAATTCCA CTTATTATTT TAAAATAGTG
GCCAAAAAAC AGTTTATTGA ATATGTTGAC GACATACTTC AGAACGTGGA ATATGTGTCC
GACCCTGCTG TAAAAGTGAT TATCACTCCG GCAGGAGAGC CGATAAACCA GCCCAATGTG
CCGGCAAAGC CGCCGCTTAA AGTAAAAAAG GATTTAAACG GTCAGTACAT GGTTACCGAA
AGCACAGTGA CCATACAGCT TAAAAACCTG TGGTATGAGA TATTTAATTT TGAGGAAAAC
AAATGGGAAT ACATACGGAC TGAGAAACTT CATTATGATG ATGTGCCGCC CTTTGACCCG
TTGACATCCG TTGTTGATGA TGTTTATTAC AGAAAAGTGA CCTATGATTC CGGTGTAAGA
ATAAATGTCG GATGCGTTGA ATATAGTGAA GGCATGTCAT ATGAAGAGCT TTACTATCTT
CCCGCTGACA AAGTGGTAGA TTTCCCTGTT GACCCGAATG ATCCGTGGGA AAACCCGGAT
TTAAATCCTG ACGGGAAGAA ACACAATGTG GATATTACAA TTACCGATCT TAAGCCCAAT
ACGGTGTATG TCATATGGGT AAGGGCCGCA AGACCCAGTG CGGACCTTGT ATCCGAACCG
TCGGATCCGA TAGTAATTAC AACAAACCCT GTTATAGAAC CTCCTTTGGA AAAGCCGGTA
GTGCCTTCCT TCAACTATCA TTCGGCGGGA GACACGTACA TTGATTTGGG CTGGGAATTT
ACCCCGGGAC ATTATTACTA TTTAAAATAC GGTCTTGAAG ACAATATAAA TACAGCAACG
GGAAATATAA AAGTTACGCC GGAAGATTTG GAAAATTCCG TATATACCAG AATAACGGAC
TTGACTCCCA ATACCCTGTA CTATTTCTGG ATACAGGCGG AAGCCGTCGG CAAAAACGGA
GAGACAATAA GGTCTGAGTG GAGTGATTCG TATCTCGTAA GGACCCTTGC ATATATTCCG
CCGGACACGC CAAAGGGATT CGGTATAAAA AACAGCATTG ACGCAATTAC AAAAAACACT
ATTACGTATG AGTGGATGCA GGAAGAAAAC CTCGAATATA TCCTTGAGCT TGCCGACAAC
GTAGACTACG AGGATGCTGT GGAATACAAG GTTGGCATGG TCTCGGAATT TACTGTGGGA
GGGCTTTTGT CAAACCACAG GTATTATGCA AGACTGTATT CCTATGACCC CGTGAAGAAT
TTGAGGTCCA ATCCCACCCA AAGCGTGGTT GTGAGGACTA AAAGAAGCAG TGATGACTAT
GATTCGGACG AGGATGTTGA CAATGTCATA ATCGGAGATT TTGTAAAAAA GGAAAAAACC
GTTAAGGACG GTGTATGGGA AGTCAGAATA GTCGGAGTTG ATGCAGACAG ATTTGTGGAT
TATGTAATTA GAGACAACAA GCTGGATATA ACCGTAAAAC TTGATGATCC GCCCCAGTCT
TATAAAAAAC TGAGAATACT GGTTTCCGAC AAGGTGTTTA AATCTCTGAC CGAGCTTTCG
GAAAATCTTA CTTTCAAAAT GAAAGATTTT TCCCTTGTCA TAAGACCTGG AACGATAACG
ACGGCAAACT TCAACCCTCT GGCAGGGAAG GCTTCGGGAG TGGATTATGA GATATGTATT
ACCCATCTTG GAACTTTCGG AACCAATGTA AAGAACATGA TATTTAAAAC CGAGACGATA
AAAATAGAAT TGGGCATAGT TGAAGGCGGT AATGTAACAC CTGTAAATTC CGTCCTAAAA
CCGCTTAAAG TTCTTTCGGA ATATGATGAT ACCGACAGAT ATACTCAAGG GAAAACATCC
GGATTTTTAT ACGACAGTGA GATTGGAAAG TGGAAAAGGC TTAATACCGC TTATGACTTC
AATTATGACA GAAATACAGG AACCCTGGCT TTTGAAACCG TTAAACTGGG CGCCACTGCG
GTGGCGGAAC TGGATAAAGA CTTTTTTGAC GACATATATT ATCATCCTTA TGAGACCAGC
ATAAATAATG TGGCATCTGT GCATGAACTA AAAAGCATTT CCACAAGGCT TTTTGAACCG
GACAACTACG CATCTTTGGG CGACACGGTA AAGTTTATGT TTGATGTGCT GGACTATGAG
TATGGAAGTG ATTTTATGAA CAAGGCTTTG AAAGCCGGAT TTATCACTTC CGCTGATATT
AAGGCTTCAA ACAGAAACTG CACCGCGGAA GATGCTTACA AGATGATAAT CAGGCTGTTT
GAGCTAAAAA CCGGAAAACT TTTGGATGCG AAAACCAAGT CGAAGTTTAT TGAAGAAAAT
GGATTTAAAT TGGTAAGAGA TGCCGGTAAA ACAGTTATGG CAAATGAGCC GATAAAAAGA
CATGAGGTTC TTGTGCTCAT TGAAAAATTG TTGGTATATA TCGGAGAACT GGAATAA
 
Protein sequence
MKMKRAGRVV GLILAVSLVL QFNIINTVWA VDPEPQSPPS KLRIEPQSPD EPAIGYNEFD 
KYYVDLKWDV SFPSFAISKY LNIYTQEIPK SYRIAKPRSV KAKDVSGNSN SYRLKELNSG
TIYYIDATAS YTYVEDSKLY RSAESAASNR VKVLTEIDIS AYAVSTNKIK IEWDDVWNTD
GRIGYKLYIS ENGSFANTPP IYIGKDQIGP DKPVKVNEST GKLEYIHTAR DPGRVYYIRI
EPDVNDAELK KNQYSKTVIV SSYILVRTTK IASTESGVIW KLEWSPVVTS LSDSNVKVSY
QIYRGQIDST DLAQYMASVD GTEFFVTLPP GEVEHYFIIR AIVTKDGLDV YEGIRIESER
IIVREHETPS YPAAPVLVDK FEKSPGETII SYDEELKPNS ATILWKVPTR GDGQIDTDIM
YDIWLVDDPN LIDNPPEGRK IASNISMGSS NYVISGDTVI GYKYVVSNLT PNSTYYFKIV
AKKQFIEYVD DILQNVEYVS DPAVKVIITP AGEPINQPNV PAKPPLKVKK DLNGQYMVTE
STVTIQLKNL WYEIFNFEEN KWEYIRTEKL HYDDVPPFDP LTSVVDDVYY RKVTYDSGVR
INVGCVEYSE GMSYEELYYL PADKVVDFPV DPNDPWENPD LNPDGKKHNV DITITDLKPN
TVYVIWVRAA RPSADLVSEP SDPIVITTNP VIEPPLEKPV VPSFNYHSAG DTYIDLGWEF
TPGHYYYLKY GLEDNINTAT GNIKVTPEDL ENSVYTRITD LTPNTLYYFW IQAEAVGKNG
ETIRSEWSDS YLVRTLAYIP PDTPKGFGIK NSIDAITKNT ITYEWMQEEN LEYILELADN
VDYEDAVEYK VGMVSEFTVG GLLSNHRYYA RLYSYDPVKN LRSNPTQSVV VRTKRSSDDY
DSDEDVDNVI IGDFVKKEKT VKDGVWEVRI VGVDADRFVD YVIRDNKLDI TVKLDDPPQS
YKKLRILVSD KVFKSLTELS ENLTFKMKDF SLVIRPGTIT TANFNPLAGK ASGVDYEICI
THLGTFGTNV KNMIFKTETI KIELGIVEGG NVTPVNSVLK PLKVLSEYDD TDRYTQGKTS
GFLYDSEIGK WKRLNTAYDF NYDRNTGTLA FETVKLGATA VAELDKDFFD DIYYHPYETS
INNVASVHEL KSISTRLFEP DNYASLGDTV KFMFDVLDYE YGSDFMNKAL KAGFITSADI
KASNRNCTAE DAYKMIIRLF ELKTGKLLDA KTKSKFIEEN GFKLVRDAGK TVMANEPIKR
HEVLVLIEKL LVYIGELE