Gene Cthe_2348 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_2348 
Symbol 
ID4808982 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp2800406 
End bp2803516 
Gene Length3111 bp 
Protein Length1036 aa 
Translation table11 
GC content40% 
IMG OID640107755 
ProductIg-related protein 
Protein accessionYP_001038743 
Protein GI125974833 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0040576 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGAATC TCAAAAAAGT GCTGGCAGTG TTAGTTGTAA TCTCAGTGAT TTCAACTCTC 
TTAGTGCCTG CATTTGCTGA TTCATTCAGC TATGAAAAAG AAGCAGAAAT TTTGTATAGG
CTCGGCTTAT ACAAAGGAAC ATCAGAAACA GAGTATGTTC CAAACTTGGA AGGTAAACTT
GACAGACAGA CCGGAGTAGT TATGCTCCTC AGACTGTTCG GTCAGGAAGA CGATGCATTG
GAAATTCCAA TGGATGAGGC AGCTCAGACA CTTGCTGCTA AATTCAAAGA TGCAGCTGAC
ATTGCAGACT GGGCACAAAG ACAAGTGGCT TATGCAGTTG AAAAGGGATA TGTAAAAGGT
TATCCGGACG GAACATTCCT TCCGAACGCA GACCTCAACG GCTTGGCTTT CTGCTCATTG
ATTCTTCAGC AGTTGGGATA TGACGGAGAC TTTGTTTTCG ATGAAGCTGC GTACAAGTTG
CAAGAGTTTG GCGGCTTGAC TGCAGAACAA GCTGAAGCGT TCAACAACAA GAACGGAATC
AACAGAGACT CAATGGTTGG TATTGCTTTC TCAGCTTTGC AGGCTGTATA CAAAGCTACA
GGAAAGACAG TTATTGAGGT TCTCGTAGAG AACGGAAATG TTTCCAAAGA ACTTGCTATA
GAACTCGGTG TTCTTTTGAA AGCCATCAAG GAAGTAAAAG CTTTGGATGC TGTTAAGGTT
CAGGTTGGAA AAGAACCTGT ACTTCCTGAA GAAGTAGAAG TAGTATATGA AGATGACACA
ACTGAAAAAC TTGCAGTTGA ATGGCCTACA GTTGATACTT CAGAAGTTGG TGAACAGGAA
ATCGAAGGTA CTATCAAAGG TGCCAGCGGT TTGGCTTACA GAGAACCAAA GGCTACTCTC
AAGGTTATAG TAACACCTGA AGAACTCCAA GTTGTAGATG TTAAGGCTCC TAACCTTAAG
GAAATTGTAA TTGAATTCAA CGGAGAAGTA GCTTCAAAAG CTGATGAAAA ATCCAGCTAC
TCAGTTGAAG ACAATACTAT TGAATTGGTT ACAGTATCAG AAGACAAGAC TACAGTTACA
TTGACAGTTG CTGGTGCTAT GACAGCTGAA GAAGAAATCG AAGTAACAAT TAAAACAGCA
ACTGGCTTGA AGGAAGAAGT TACTAAGACT GTAGTACCTG CTGACTACGA AAATCCGGAA
GCTGAATCCA TTGCTTTGAT AGGTCCGAAC TCCTTTGAAA TTAAATTCTC AGAACCTGTT
CAGAGCAGCT CAGATGCAGA AGTTCTCGTT AATGACGGAA CTTATTATGT AAGTGAAGAA
AAACTGTCAC AGGACTACAG AACATTAACT GTAGAACTGG GCGTAAGTTC ATTGAATGAA
GGAACTTACA AAGTAAAAGT TAAAGGTTAC AGAGACTATG CTGGAAACAT AATGAGAACA
AAGACCTTTG ACTTGGAGTA TGTAAAAGAT ACAACTCCTC CAACTGCTAA AGTAAAAGAA
GCAACACAGA ACAAAGTAGT AATTGAATTC AATGAGCCTG CTACAAGAGA TGGTTACTCT
GGTGATGAAG CAGCTCTTAC AAGAGATTAC TTCTATCAGA CATATTCTTC CTGGAAGCCA
ACTAAAGTTG TAGCTTCAGA CAATAACAAA GTTTATACTT TGTACTTCTC TGAAGACCAG
AACGATGGTG GCTATCCTGT ATATCTGCTT CCGGTAGGAA ACGTTACTAT AACAATCCTC
AAGGAAGTAG ATGATGACGC TGTAGTAGAT GCATGGGGCA ACAAGCTCGA GTCCGATCTT
AAACTTACTG CTACAGTAGC AGCCGATAAT GAGGCTCCAA CAGTAAAAAG TGTAACAGCT
GAAGCAGAAG ATAAAATCGT GGTTGTATTC AGCGAAGATG TAAACGAGAA CCAGGCAAAA
GATAAGGACA ACTATGTAAT TAAGAAAGAC GGAAAAGAAA TAGACACAGC TATTTCAAGC
ATCACATATG ACAGCAATGA AACAAAGGTA ACAATTGTTT TGGATGAAAA GCTTAGTGGT
GGAAAATATA CAATAGATAT TAAGGGTATT AAAGATACTT CAGTATCTGA AAACGAAATG
AAAGCAGTTA CTATTGAATT TGAAGTAACT GACAAGACAG CTCCGACAAT TGAAGAAGTA
ACATTTGTTG ACAACTACAT CTATGTAAGA TACAGCGAAG CTATGTCAAC AAAGGGCAAC
GGTTCAGTAC TGAACAAGGA CAACTACAAA CTCGTAGATG ACAACGATAA GAAAGTAGAA
ATTAAGAAAA TTGAATTGTT TGGCTCTGAC AAGAACAAAG TAAGAATAAC TGTTGATAGT
GATGTAGATC TTAACGTAGA TTACGAACTC ACAATTGCTA ACGTTGAGGA TGAAGCTGGT
AATGCTATAA GTGCATTCGA TGTTAAGGCA AAGAAATTGA GTGAAGAGCA AGCACCAGAA
GTATCAGAAA TTAGAATTAT CAGCAAGACT GAAATTGAAA TAGTAATTAA CAAGATCCTT
GACAAGGCAA CTGTTGAAAA GACTGACTTT GAAGTAGAAA GAGGCAGCAA CAAAGTTGCA
CTCACAAGAA TAAGCTCAAT CACCTATGAT GATGGTAAAA CAATAGTTAA GGGTGTACTC
CCGGATGCAG TACGTCCTGC TAACTCCGGA GACATCACAG GTTATACGCT CTACATTGTG
GGTGAAATTA AATCCGATAC AGGTAAGGAA ATGGCAACAG GAGCAGTTTC GAAGCCAGTT
GATGATAAGT TTGCACCAAG CTTTGTAAGC GTAGCTAATG GTGTATACGG CGATGCATCA
AAGAAAGGAT TCACATTGAC ATTCGATGAA GACATCAAGT TCTTGAACAA CTCAGCTGGT
TTGGGTGCAA CCGACCTCGT AATCAAGAAC GGTAGCAAGA CTCTTGAAGC TGGTATCGAC
TATGATGTAG CAGCTATCGA TAACAAGATA ACAGTTACAC TCAAAGGTGA CGACTATGCT
GACTTCACAG GAACTCTTAA AGTTTCAACT AAGGATACTG TGAAGTATAT CACAGACGAG
GCTGGTAATG CACTCAACAA GTTCGAAAAT AAAGAGGTAA AGGTTCAATA A
 
Protein sequence
MKNLKKVLAV LVVISVISTL LVPAFADSFS YEKEAEILYR LGLYKGTSET EYVPNLEGKL 
DRQTGVVMLL RLFGQEDDAL EIPMDEAAQT LAAKFKDAAD IADWAQRQVA YAVEKGYVKG
YPDGTFLPNA DLNGLAFCSL ILQQLGYDGD FVFDEAAYKL QEFGGLTAEQ AEAFNNKNGI
NRDSMVGIAF SALQAVYKAT GKTVIEVLVE NGNVSKELAI ELGVLLKAIK EVKALDAVKV
QVGKEPVLPE EVEVVYEDDT TEKLAVEWPT VDTSEVGEQE IEGTIKGASG LAYREPKATL
KVIVTPEELQ VVDVKAPNLK EIVIEFNGEV ASKADEKSSY SVEDNTIELV TVSEDKTTVT
LTVAGAMTAE EEIEVTIKTA TGLKEEVTKT VVPADYENPE AESIALIGPN SFEIKFSEPV
QSSSDAEVLV NDGTYYVSEE KLSQDYRTLT VELGVSSLNE GTYKVKVKGY RDYAGNIMRT
KTFDLEYVKD TTPPTAKVKE ATQNKVVIEF NEPATRDGYS GDEAALTRDY FYQTYSSWKP
TKVVASDNNK VYTLYFSEDQ NDGGYPVYLL PVGNVTITIL KEVDDDAVVD AWGNKLESDL
KLTATVAADN EAPTVKSVTA EAEDKIVVVF SEDVNENQAK DKDNYVIKKD GKEIDTAISS
ITYDSNETKV TIVLDEKLSG GKYTIDIKGI KDTSVSENEM KAVTIEFEVT DKTAPTIEEV
TFVDNYIYVR YSEAMSTKGN GSVLNKDNYK LVDDNDKKVE IKKIELFGSD KNKVRITVDS
DVDLNVDYEL TIANVEDEAG NAISAFDVKA KKLSEEQAPE VSEIRIISKT EIEIVINKIL
DKATVEKTDF EVERGSNKVA LTRISSITYD DGKTIVKGVL PDAVRPANSG DITGYTLYIV
GEIKSDTGKE MATGAVSKPV DDKFAPSFVS VANGVYGDAS KKGFTLTFDE DIKFLNNSAG
LGATDLVIKN GSKTLEAGID YDVAAIDNKI TVTLKGDDYA DFTGTLKVST KDTVKYITDE
AGNALNKFEN KEVKVQ