Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_2348 |
Symbol | |
ID | 4808982 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | - |
Start bp | 2800406 |
End bp | 2803516 |
Gene Length | 3111 bp |
Protein Length | 1036 aa |
Translation table | 11 |
GC content | 40% |
IMG OID | 640107755 |
Product | Ig-related protein |
Protein accession | YP_001038743 |
Protein GI | 125974833 |
COG category | |
COG ID | |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 0.0040576 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAGAATC TCAAAAAAGT GCTGGCAGTG TTAGTTGTAA TCTCAGTGAT TTCAACTCTC TTAGTGCCTG CATTTGCTGA TTCATTCAGC TATGAAAAAG AAGCAGAAAT TTTGTATAGG CTCGGCTTAT ACAAAGGAAC ATCAGAAACA GAGTATGTTC CAAACTTGGA AGGTAAACTT GACAGACAGA CCGGAGTAGT TATGCTCCTC AGACTGTTCG GTCAGGAAGA CGATGCATTG GAAATTCCAA TGGATGAGGC AGCTCAGACA CTTGCTGCTA AATTCAAAGA TGCAGCTGAC ATTGCAGACT GGGCACAAAG ACAAGTGGCT TATGCAGTTG AAAAGGGATA TGTAAAAGGT TATCCGGACG GAACATTCCT TCCGAACGCA GACCTCAACG GCTTGGCTTT CTGCTCATTG ATTCTTCAGC AGTTGGGATA TGACGGAGAC TTTGTTTTCG ATGAAGCTGC GTACAAGTTG CAAGAGTTTG GCGGCTTGAC TGCAGAACAA GCTGAAGCGT TCAACAACAA GAACGGAATC AACAGAGACT CAATGGTTGG TATTGCTTTC TCAGCTTTGC AGGCTGTATA CAAAGCTACA GGAAAGACAG TTATTGAGGT TCTCGTAGAG AACGGAAATG TTTCCAAAGA ACTTGCTATA GAACTCGGTG TTCTTTTGAA AGCCATCAAG GAAGTAAAAG CTTTGGATGC TGTTAAGGTT CAGGTTGGAA AAGAACCTGT ACTTCCTGAA GAAGTAGAAG TAGTATATGA AGATGACACA ACTGAAAAAC TTGCAGTTGA ATGGCCTACA GTTGATACTT CAGAAGTTGG TGAACAGGAA ATCGAAGGTA CTATCAAAGG TGCCAGCGGT TTGGCTTACA GAGAACCAAA GGCTACTCTC AAGGTTATAG TAACACCTGA AGAACTCCAA GTTGTAGATG TTAAGGCTCC TAACCTTAAG GAAATTGTAA TTGAATTCAA CGGAGAAGTA GCTTCAAAAG CTGATGAAAA ATCCAGCTAC TCAGTTGAAG ACAATACTAT TGAATTGGTT ACAGTATCAG AAGACAAGAC TACAGTTACA TTGACAGTTG CTGGTGCTAT GACAGCTGAA GAAGAAATCG AAGTAACAAT TAAAACAGCA ACTGGCTTGA AGGAAGAAGT TACTAAGACT GTAGTACCTG CTGACTACGA AAATCCGGAA GCTGAATCCA TTGCTTTGAT AGGTCCGAAC TCCTTTGAAA TTAAATTCTC AGAACCTGTT CAGAGCAGCT CAGATGCAGA AGTTCTCGTT AATGACGGAA CTTATTATGT AAGTGAAGAA AAACTGTCAC AGGACTACAG AACATTAACT GTAGAACTGG GCGTAAGTTC ATTGAATGAA GGAACTTACA AAGTAAAAGT TAAAGGTTAC AGAGACTATG CTGGAAACAT AATGAGAACA AAGACCTTTG ACTTGGAGTA TGTAAAAGAT ACAACTCCTC CAACTGCTAA AGTAAAAGAA GCAACACAGA ACAAAGTAGT AATTGAATTC AATGAGCCTG CTACAAGAGA TGGTTACTCT GGTGATGAAG CAGCTCTTAC AAGAGATTAC TTCTATCAGA CATATTCTTC CTGGAAGCCA ACTAAAGTTG TAGCTTCAGA CAATAACAAA GTTTATACTT TGTACTTCTC TGAAGACCAG AACGATGGTG GCTATCCTGT ATATCTGCTT CCGGTAGGAA ACGTTACTAT AACAATCCTC AAGGAAGTAG ATGATGACGC TGTAGTAGAT GCATGGGGCA ACAAGCTCGA GTCCGATCTT AAACTTACTG CTACAGTAGC AGCCGATAAT GAGGCTCCAA CAGTAAAAAG TGTAACAGCT GAAGCAGAAG ATAAAATCGT GGTTGTATTC AGCGAAGATG TAAACGAGAA CCAGGCAAAA GATAAGGACA ACTATGTAAT TAAGAAAGAC GGAAAAGAAA TAGACACAGC TATTTCAAGC ATCACATATG ACAGCAATGA AACAAAGGTA ACAATTGTTT TGGATGAAAA GCTTAGTGGT GGAAAATATA CAATAGATAT TAAGGGTATT AAAGATACTT CAGTATCTGA AAACGAAATG AAAGCAGTTA CTATTGAATT TGAAGTAACT GACAAGACAG CTCCGACAAT TGAAGAAGTA ACATTTGTTG ACAACTACAT CTATGTAAGA TACAGCGAAG CTATGTCAAC AAAGGGCAAC GGTTCAGTAC TGAACAAGGA CAACTACAAA CTCGTAGATG ACAACGATAA GAAAGTAGAA ATTAAGAAAA TTGAATTGTT TGGCTCTGAC AAGAACAAAG TAAGAATAAC TGTTGATAGT GATGTAGATC TTAACGTAGA TTACGAACTC ACAATTGCTA ACGTTGAGGA TGAAGCTGGT AATGCTATAA GTGCATTCGA TGTTAAGGCA AAGAAATTGA GTGAAGAGCA AGCACCAGAA GTATCAGAAA TTAGAATTAT CAGCAAGACT GAAATTGAAA TAGTAATTAA CAAGATCCTT GACAAGGCAA CTGTTGAAAA GACTGACTTT GAAGTAGAAA GAGGCAGCAA CAAAGTTGCA CTCACAAGAA TAAGCTCAAT CACCTATGAT GATGGTAAAA CAATAGTTAA GGGTGTACTC CCGGATGCAG TACGTCCTGC TAACTCCGGA GACATCACAG GTTATACGCT CTACATTGTG GGTGAAATTA AATCCGATAC AGGTAAGGAA ATGGCAACAG GAGCAGTTTC GAAGCCAGTT GATGATAAGT TTGCACCAAG CTTTGTAAGC GTAGCTAATG GTGTATACGG CGATGCATCA AAGAAAGGAT TCACATTGAC ATTCGATGAA GACATCAAGT TCTTGAACAA CTCAGCTGGT TTGGGTGCAA CCGACCTCGT AATCAAGAAC GGTAGCAAGA CTCTTGAAGC TGGTATCGAC TATGATGTAG CAGCTATCGA TAACAAGATA ACAGTTACAC TCAAAGGTGA CGACTATGCT GACTTCACAG GAACTCTTAA AGTTTCAACT AAGGATACTG TGAAGTATAT CACAGACGAG GCTGGTAATG CACTCAACAA GTTCGAAAAT AAAGAGGTAA AGGTTCAATA A
|
Protein sequence | MKNLKKVLAV LVVISVISTL LVPAFADSFS YEKEAEILYR LGLYKGTSET EYVPNLEGKL DRQTGVVMLL RLFGQEDDAL EIPMDEAAQT LAAKFKDAAD IADWAQRQVA YAVEKGYVKG YPDGTFLPNA DLNGLAFCSL ILQQLGYDGD FVFDEAAYKL QEFGGLTAEQ AEAFNNKNGI NRDSMVGIAF SALQAVYKAT GKTVIEVLVE NGNVSKELAI ELGVLLKAIK EVKALDAVKV QVGKEPVLPE EVEVVYEDDT TEKLAVEWPT VDTSEVGEQE IEGTIKGASG LAYREPKATL KVIVTPEELQ VVDVKAPNLK EIVIEFNGEV ASKADEKSSY SVEDNTIELV TVSEDKTTVT LTVAGAMTAE EEIEVTIKTA TGLKEEVTKT VVPADYENPE AESIALIGPN SFEIKFSEPV QSSSDAEVLV NDGTYYVSEE KLSQDYRTLT VELGVSSLNE GTYKVKVKGY RDYAGNIMRT KTFDLEYVKD TTPPTAKVKE ATQNKVVIEF NEPATRDGYS GDEAALTRDY FYQTYSSWKP TKVVASDNNK VYTLYFSEDQ NDGGYPVYLL PVGNVTITIL KEVDDDAVVD AWGNKLESDL KLTATVAADN EAPTVKSVTA EAEDKIVVVF SEDVNENQAK DKDNYVIKKD GKEIDTAISS ITYDSNETKV TIVLDEKLSG GKYTIDIKGI KDTSVSENEM KAVTIEFEVT DKTAPTIEEV TFVDNYIYVR YSEAMSTKGN GSVLNKDNYK LVDDNDKKVE IKKIELFGSD KNKVRITVDS DVDLNVDYEL TIANVEDEAG NAISAFDVKA KKLSEEQAPE VSEIRIISKT EIEIVINKIL DKATVEKTDF EVERGSNKVA LTRISSITYD DGKTIVKGVL PDAVRPANSG DITGYTLYIV GEIKSDTGKE MATGAVSKPV DDKFAPSFVS VANGVYGDAS KKGFTLTFDE DIKFLNNSAG LGATDLVIKN GSKTLEAGID YDVAAIDNKI TVTLKGDDYA DFTGTLKVST KDTVKYITDE AGNALNKFEN KEVKVQ
|
| |