Gene Cthe_1630 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_1630 
Symbol 
ID4809325 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp1956610 
End bp1957932 
Gene Length1323 bp 
Protein Length440 aa 
Translation table11 
GC content49% 
IMG OID640107046 
ProductHK97 family phage portal protein 
Protein accessionYP_001038047 
Protein GI125974137 
COG category[S] Function unknown 
COG ID[COG4695] Phage-related protein 
TIGRFAM ID[TIGR01537] phage portal protein, HK97 family 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.506997 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGAATAT TTTCCCGCTT GTTCAAAGCA AGGGACAAGC CGAAAAACAG CCTGTTCGGT 
AATGCATATA GCTTTTTCTT CGGCGGCACA TCCAGCGGAA AAGCTGTCAA TGAGCGGACT
GCTATGCAGA CAACTGCAGT GTATGCCTGT GTAAGGATAC TTGCAGAAGC CATCGCCGGG
CTTCCGCTTC ATGTATACCG ATACAAGGAA GACGGTGGCA AAGAAAAAGC GTTGACCCAC
CCGCTCTATT ATTTGCTCCA TGACGAACCA AACCCTGAGA TGACTTCATT TGTGTTCCGA
GAAACACTGA TGAGTCATCT TCTTTTATGG GGAAATGCTT ATGCCCAGAT TGTCAGGGAC
GGTTCCGGGC GAGTGCTGGC GCTTTATCCC CTTTTGCCAA ACAAAATGAC GGTAGACAGG
GCTCCAAACG GAGAGCTGTA TTACACTTAT CGGCGCGACA GCGATGAGAG CAGGGTTAAT
CCAAAAGCAG GCCTTATATA CCTACGAAGT GATGAGGTTC TTCACATCCC GGGACTCGGT
TTTGACGGAC TGATCGGATA CTCCCCTATT GCTATGGCCA AGAATGCCAT AGGCATGGCT
ATTGCCTGTG AGGAGTATGG TGCATCCTTT TTTGCCAACG GAGCAAATCC GGGTGGCGTT
CTGGAACATC CCGGCGTATT AAAGGATCCG GCAAAGGTGC GTGAAAGCTG GAACGCTGTT
TATCAAGGAA GTGCCAATGC TCACCGTATT GCAGTCCTGG AAGAGGGAAT GAAGTTTCAG
CCAATCGGCA TTCCACCCGA ACAGGCACAG TTTTTGGAGA CAAGAAAGTT CCAGATAAAC
GAAATTGCCC GGATATTCCG AGTACCTCCC CATATGGTTG GAGATCTTGA AAAGTCAAGC
TTTTCAAACA TCGAACAGCA ATCTCTGGAA TTTGTTAAAT ACACGCTTGA CCCGTGGGTG
GTGCGTTGGG AACAGGCTCT CCAAAAAGCG CTGCTTTTAC CATCAGAGAA GCGGGCATAC
TTTGTCAAAT TCAATGTAGA TGGCCTTCTG CGCGGTGATT ATGCAAGCCG CATGAATGGT
TATGCTGTAG CTCGCCAGAA CGGCTGGATG TCTGCTAACG ATATCCGCGA GCTTGAGGAC
ATGAACCGGA TTCCGGCGGA GTTGGGCGGA GATCTGTATC TTGTTAACGG TAACATGACC
AGGCTTGCCG ATGCAGGTAC ATTTGCAGGC AAAAACAATG CTGAAACGGA GGGATCAAAA
GTTGAACAAA TCACAAAAAC AAAAACCGGT TCGCCGCTTC TGGAACTGGA TACAAAACGA
TGA
 
Protein sequence
MRIFSRLFKA RDKPKNSLFG NAYSFFFGGT SSGKAVNERT AMQTTAVYAC VRILAEAIAG 
LPLHVYRYKE DGGKEKALTH PLYYLLHDEP NPEMTSFVFR ETLMSHLLLW GNAYAQIVRD
GSGRVLALYP LLPNKMTVDR APNGELYYTY RRDSDESRVN PKAGLIYLRS DEVLHIPGLG
FDGLIGYSPI AMAKNAIGMA IACEEYGASF FANGANPGGV LEHPGVLKDP AKVRESWNAV
YQGSANAHRI AVLEEGMKFQ PIGIPPEQAQ FLETRKFQIN EIARIFRVPP HMVGDLEKSS
FSNIEQQSLE FVKYTLDPWV VRWEQALQKA LLLPSEKRAY FVKFNVDGLL RGDYASRMNG
YAVARQNGWM SANDIRELED MNRIPAELGG DLYLVNGNMT RLADAGTFAG KNNAETEGSK
VEQITKTKTG SPLLELDTKR