Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_2612 |
Symbol | |
ID | 4809034 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | + |
Start bp | 3087105 |
End bp | 3090941 |
Gene Length | 3837 bp |
Protein Length | 1278 aa |
Translation table | 11 |
GC content | 42% |
IMG OID | 640108026 |
Product | fibronectin, type III |
Protein accession | YP_001039005 |
Protein GI | 125975095 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAATGA AAAGAGCGGG AAGAGTTGTC GGATTAATAT TGGCAGTAAG TCTCGTTTTG CAATTTAATA TAATAAATAC GGTTTGGGCT GTCGACCCGG AGCCTCAATC CCCACCGTCA AAGCTTAGGA TTGAGCCGCA AAGTCCGGAT GAGCCTGCCA TTGGCTATAA TGAATTTGAC AAATATTATG TTGACCTGAA ATGGGACGTA AGTTTTCCTT CCTTTGCGAT TTCCAAATAT CTTAACATCT ACACGCAGGA AATCCCCAAG TCTTACAGAA TAGCAAAACC TCGCAGTGTG AAAGCAAAGG ATGTTTCCGG AAATTCGAAC TCATACCGCC TGAAGGAGCT CAATTCAGGT ACAATTTACT ATATTGATGC GACTGCCTCT TACACGTATG TCGAGGACAG CAAGCTATAC AGAAGTGCGG AATCGGCTGC TTCAAACAGG GTGAAGGTCT TGACCGAGAT TGATATCAGC GCATATGCAG TTTCCACAAA CAAGATTAAA ATAGAATGGG ATGATGTGTG GAATACTGAC GGAAGAATAG GTTACAAGCT TTATATTTCC GAAAACGGCA GTTTTGCCAA TACGCCGCCG ATATACATAG GAAAAGACCA GATAGGCCCG GACAAGCCGG TAAAAGTTAA TGAATCAACG GGAAAGCTGG AGTATATTCA TACCGCAAGA GACCCGGGAA GGGTTTATTA TATCAGAATA GAACCGGACG TAAATGATGC GGAGCTTAAG AAAAACCAGT ACAGCAAAAC CGTTATTGTG AGTAGTTACA TTTTGGTCAG GACAACAAAA ATAGCTTCAA CGGAATCCGG GGTTATTTGG AAGCTGGAAT GGAGTCCGGT TGTTACCAGC CTGAGTGACA GCAATGTAAA AGTCAGCTAC CAGATTTACA GAGGTCAAAT AGATTCCACC GATCTGGCCC AGTATATGGC CTCTGTGGAC GGCACCGAGT TTTTTGTCAC GCTTCCGCCC GGTGAGGTTG AACATTATTT TATAATAAGG GCTATTGTAA CCAAAGACGG GCTTGACGTG TATGAGGGCA TAAGGATAGA ATCCGAACGG ATAATAGTAA GGGAGCATGA AACACCTTCT TATCCTGCGG CTCCCGTACT TGTGGACAAA TTTGAAAAGT CTCCGGGAGA AACGATTATA AGCTATGATG AGGAATTAAA ACCCAATAGT GCAACAATTT TGTGGAAAGT TCCCACCCGC GGAGACGGGC AGATAGACAC CGATATTATG TATGACATAT GGCTTGTGGA CGATCCGAAC CTTATTGACA ATCCGCCGGA GGGCAGAAAA ATTGCTTCAA ACATATCAAT GGGAAGCAGC AACTATGTAA TAAGCGGAGA TACGGTTATA GGTTATAAAT ATGTTGTTTC AAATTTGACT CCCAATTCCA CTTATTATTT TAAAATAGTG GCCAAAAAAC AGTTTATTGA ATATGTTGAC GACATACTTC AGAACGTGGA ATATGTGTCC GACCCTGCTG TAAAAGTGAT TATCACTCCG GCAGGAGAGC CGATAAACCA GCCCAATGTG CCGGCAAAGC CGCCGCTTAA AGTAAAAAAG GATTTAAACG GTCAGTACAT GGTTACCGAA AGCACAGTGA CCATACAGCT TAAAAACCTG TGGTATGAGA TATTTAATTT TGAGGAAAAC AAATGGGAAT ACATACGGAC TGAGAAACTT CATTATGATG ATGTGCCGCC CTTTGACCCG TTGACATCCG TTGTTGATGA TGTTTATTAC AGAAAAGTGA CCTATGATTC CGGTGTAAGA ATAAATGTCG GATGCGTTGA ATATAGTGAA GGCATGTCAT ATGAAGAGCT TTACTATCTT CCCGCTGACA AAGTGGTAGA TTTCCCTGTT GACCCGAATG ATCCGTGGGA AAACCCGGAT TTAAATCCTG ACGGGAAGAA ACACAATGTG GATATTACAA TTACCGATCT TAAGCCCAAT ACGGTGTATG TCATATGGGT AAGGGCCGCA AGACCCAGTG CGGACCTTGT ATCCGAACCG TCGGATCCGA TAGTAATTAC AACAAACCCT GTTATAGAAC CTCCTTTGGA AAAGCCGGTA GTGCCTTCCT TCAACTATCA TTCGGCGGGA GACACGTACA TTGATTTGGG CTGGGAATTT ACCCCGGGAC ATTATTACTA TTTAAAATAC GGTCTTGAAG ACAATATAAA TACAGCAACG GGAAATATAA AAGTTACGCC GGAAGATTTG GAAAATTCCG TATATACCAG AATAACGGAC TTGACTCCCA ATACCCTGTA CTATTTCTGG ATACAGGCGG AAGCCGTCGG CAAAAACGGA GAGACAATAA GGTCTGAGTG GAGTGATTCG TATCTCGTAA GGACCCTTGC ATATATTCCG CCGGACACGC CAAAGGGATT CGGTATAAAA AACAGCATTG ACGCAATTAC AAAAAACACT ATTACGTATG AGTGGATGCA GGAAGAAAAC CTCGAATATA TCCTTGAGCT TGCCGACAAC GTAGACTACG AGGATGCTGT GGAATACAAG GTTGGCATGG TCTCGGAATT TACTGTGGGA GGGCTTTTGT CAAACCACAG GTATTATGCA AGACTGTATT CCTATGACCC CGTGAAGAAT TTGAGGTCCA ATCCCACCCA AAGCGTGGTT GTGAGGACTA AAAGAAGCAG TGATGACTAT GATTCGGACG AGGATGTTGA CAATGTCATA ATCGGAGATT TTGTAAAAAA GGAAAAAACC GTTAAGGACG GTGTATGGGA AGTCAGAATA GTCGGAGTTG ATGCAGACAG ATTTGTGGAT TATGTAATTA GAGACAACAA GCTGGATATA ACCGTAAAAC TTGATGATCC GCCCCAGTCT TATAAAAAAC TGAGAATACT GGTTTCCGAC AAGGTGTTTA AATCTCTGAC CGAGCTTTCG GAAAATCTTA CTTTCAAAAT GAAAGATTTT TCCCTTGTCA TAAGACCTGG AACGATAACG ACGGCAAACT TCAACCCTCT GGCAGGGAAG GCTTCGGGAG TGGATTATGA GATATGTATT ACCCATCTTG GAACTTTCGG AACCAATGTA AAGAACATGA TATTTAAAAC CGAGACGATA AAAATAGAAT TGGGCATAGT TGAAGGCGGT AATGTAACAC CTGTAAATTC CGTCCTAAAA CCGCTTAAAG TTCTTTCGGA ATATGATGAT ACCGACAGAT ATACTCAAGG GAAAACATCC GGATTTTTAT ACGACAGTGA GATTGGAAAG TGGAAAAGGC TTAATACCGC TTATGACTTC AATTATGACA GAAATACAGG AACCCTGGCT TTTGAAACCG TTAAACTGGG CGCCACTGCG GTGGCGGAAC TGGATAAAGA CTTTTTTGAC GACATATATT ATCATCCTTA TGAGACCAGC ATAAATAATG TGGCATCTGT GCATGAACTA AAAAGCATTT CCACAAGGCT TTTTGAACCG GACAACTACG CATCTTTGGG CGACACGGTA AAGTTTATGT TTGATGTGCT GGACTATGAG TATGGAAGTG ATTTTATGAA CAAGGCTTTG AAAGCCGGAT TTATCACTTC CGCTGATATT AAGGCTTCAA ACAGAAACTG CACCGCGGAA GATGCTTACA AGATGATAAT CAGGCTGTTT GAGCTAAAAA CCGGAAAACT TTTGGATGCG AAAACCAAGT CGAAGTTTAT TGAAGAAAAT GGATTTAAAT TGGTAAGAGA TGCCGGTAAA ACAGTTATGG CAAATGAGCC GATAAAAAGA CATGAGGTTC TTGTGCTCAT TGAAAAATTG TTGGTATATA TCGGAGAACT GGAATAA
|
Protein sequence | MKMKRAGRVV GLILAVSLVL QFNIINTVWA VDPEPQSPPS KLRIEPQSPD EPAIGYNEFD KYYVDLKWDV SFPSFAISKY LNIYTQEIPK SYRIAKPRSV KAKDVSGNSN SYRLKELNSG TIYYIDATAS YTYVEDSKLY RSAESAASNR VKVLTEIDIS AYAVSTNKIK IEWDDVWNTD GRIGYKLYIS ENGSFANTPP IYIGKDQIGP DKPVKVNEST GKLEYIHTAR DPGRVYYIRI EPDVNDAELK KNQYSKTVIV SSYILVRTTK IASTESGVIW KLEWSPVVTS LSDSNVKVSY QIYRGQIDST DLAQYMASVD GTEFFVTLPP GEVEHYFIIR AIVTKDGLDV YEGIRIESER IIVREHETPS YPAAPVLVDK FEKSPGETII SYDEELKPNS ATILWKVPTR GDGQIDTDIM YDIWLVDDPN LIDNPPEGRK IASNISMGSS NYVISGDTVI GYKYVVSNLT PNSTYYFKIV AKKQFIEYVD DILQNVEYVS DPAVKVIITP AGEPINQPNV PAKPPLKVKK DLNGQYMVTE STVTIQLKNL WYEIFNFEEN KWEYIRTEKL HYDDVPPFDP LTSVVDDVYY RKVTYDSGVR INVGCVEYSE GMSYEELYYL PADKVVDFPV DPNDPWENPD LNPDGKKHNV DITITDLKPN TVYVIWVRAA RPSADLVSEP SDPIVITTNP VIEPPLEKPV VPSFNYHSAG DTYIDLGWEF TPGHYYYLKY GLEDNINTAT GNIKVTPEDL ENSVYTRITD LTPNTLYYFW IQAEAVGKNG ETIRSEWSDS YLVRTLAYIP PDTPKGFGIK NSIDAITKNT ITYEWMQEEN LEYILELADN VDYEDAVEYK VGMVSEFTVG GLLSNHRYYA RLYSYDPVKN LRSNPTQSVV VRTKRSSDDY DSDEDVDNVI IGDFVKKEKT VKDGVWEVRI VGVDADRFVD YVIRDNKLDI TVKLDDPPQS YKKLRILVSD KVFKSLTELS ENLTFKMKDF SLVIRPGTIT TANFNPLAGK ASGVDYEICI THLGTFGTNV KNMIFKTETI KIELGIVEGG NVTPVNSVLK PLKVLSEYDD TDRYTQGKTS GFLYDSEIGK WKRLNTAYDF NYDRNTGTLA FETVKLGATA VAELDKDFFD DIYYHPYETS INNVASVHEL KSISTRLFEP DNYASLGDTV KFMFDVLDYE YGSDFMNKAL KAGFITSADI KASNRNCTAE DAYKMIIRLF ELKTGKLLDA KTKSKFIEEN GFKLVRDAGK TVMANEPIKR HEVLVLIEKL LVYIGELE
|
| |