Gene Cthe_2476 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_2476 
Symbol 
ID4809856 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp2948505 
End bp2949914 
Gene Length1410 bp 
Protein Length469 aa 
Translation table11 
GC content36% 
IMG OID640107891 
ProductSPP1 family phage head morphogenesis protein 
Protein accessionYP_001038871 
Protein GI125974961 
COG category[T] Signal transduction mechanisms 
COG ID[COG5585] NAD+--asparagine ADP-ribosyltransferase 
TIGRFAM ID[TIGR01641] phage putative head morphogenesis protein, SPP1 gp7 family 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAATAAAA AGGACATAAC CTACTGGGAA AAACGACAGG AACGGAAATA TCTGGCCGGA 
GAGAAGAAGC TTGATGAATA TTATAAAGGT TTGCAGAAAG CGTTTAGACA AGCAAAACGA
GAAATCCAGA GTGTTATAAA TGATTTCTAC ATGCGATATG CAAAAGAAAA CAAAGTATCC
TATGCTGAAG CCCAAAAACT ACTTGATAAG GCAGAAATAG GCGAGCTGCA GGACTTTATA
GACCTTGTTA ATAAGAATAT GGGCAAGTAT AATCGAAAGC TTAACAATAT GTCTATAAAA
GCCAGAATTA CCCGCTATCA AGCGCTAGAA AAGCAGATAG ATGCTATACT ACAGCAATTA
TATGCTATTG AGTATGAGTA TAAAGGTAAA GAGCTACTGA AGGAAGTATA TGAGGATTCT
TATTATCGTA CCTGGTTTAA CATAGACCAG TACCACGGCT TTCATCAGGA GTTCGCACAG
ATTAATCCTA GAACTATAGA AGAGTTGATA AAATATCCTT GGAATGGAGC AAGTTTTTCT
GATAGGATAT GGAAGCAAAA AGACCATATG CTGCAGGTAT TAAAAGAAGA CATTACTACT
ATGTTAATAC AAGGGAAAAA TCCTCAAACA TTAGCAAGAG ATTTCGCAAG AAGGTTTAAA
ACAAAAGAAT ATGAAGCATA TAGGCTGCTA CATACAGAGA GCAGTTTTAT TATCGAACAG
GGAACTTTAG CAGCATATAA AGAAGATGGG GTGGAGAAGT ATCAGATTCT GGCTACTCTG
GACATGAGGA CATCGGATAT ATGCAGAAGT GAGGATGGGA AAATATATGA TGTGGATGAG
GCGACAGTGG GAGTAAATTA TCCTCCATAT CATCCATTTT GTAGGACCAC AACAGTGCCA
TATTATGAGG ATGCTGAGGT AGGTACAAGG GTTGCGCGTG ATCCGGTAAC AGGTAGAAGT
TATGAAGTTC CAGCGAATAT GACATATGAG CAATGGAAAA ATAGATATAT AGATCAACCT
GACAATATTA TTCGCCAAGA GATACTGAGT AATCCTGAAA GACTTGATAA TTATAGTATC
CAACATTATA ATAAGCATAA AGAAGGAACC AAACAATATG AGCAGTATAA GCAATCAAGA
CTTAAAAAAG GTCAAACTGA ACAAAGCAGT TTACTAATTT CTTACGATGA AGCTAAAGAA
ATAATAAAAA AATATGCTGG TACTGGAGTA TTTAGTAGAG ACAGGAAAGG GAAATGGAGA
AATGAGGAAT TTGTGGATGT AGATTCTATA ATTGGTGTTG TGCATAATAT TGATGGGACA
GTAACGCCTA CTAATAGAAT TCAAATAAAA TATGGGAAGA ACAGCGTGCA CATTGTACCT
GTATTACCAA GAAAGGAGAG AAATAAATGA
 
Protein sequence
MNKKDITYWE KRQERKYLAG EKKLDEYYKG LQKAFRQAKR EIQSVINDFY MRYAKENKVS 
YAEAQKLLDK AEIGELQDFI DLVNKNMGKY NRKLNNMSIK ARITRYQALE KQIDAILQQL
YAIEYEYKGK ELLKEVYEDS YYRTWFNIDQ YHGFHQEFAQ INPRTIEELI KYPWNGASFS
DRIWKQKDHM LQVLKEDITT MLIQGKNPQT LARDFARRFK TKEYEAYRLL HTESSFIIEQ
GTLAAYKEDG VEKYQILATL DMRTSDICRS EDGKIYDVDE ATVGVNYPPY HPFCRTTTVP
YYEDAEVGTR VARDPVTGRS YEVPANMTYE QWKNRYIDQP DNIIRQEILS NPERLDNYSI
QHYNKHKEGT KQYEQYKQSR LKKGQTEQSS LLISYDEAKE IIKKYAGTGV FSRDRKGKWR
NEEFVDVDSI IGVVHNIDGT VTPTNRIQIK YGKNSVHIVP VLPRKERNK