Gene Teth514_1254 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTeth514_1254 
Symbol 
ID5877029 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermoanaerobacter sp. X514 
KingdomBacteria 
Replicon accessionNC_010320 
Strand
Start bp1294819 
End bp1296018 
Gene Length1200 bp 
Protein Length399 aa 
Translation table11 
GC content43% 
IMG OID641541604 
ProductHK97 family phage major capsid protein 
Protein accessionYP_001662884 
Protein GI167039899 
COG category[R] General function prediction only 
COG ID[COG4653] Predicted phage phi-C31 gp36 major capsid-like protein 
TIGRFAM ID[TIGR01554] phage major capsid protein, HK97 family 


Plasmid Coverage information

Num covering plasmid clones34 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTAAAA TTCTTGAACT GCGTGAAAAA CGCGCAAAAG CATGGGAAGC AGCAAAGGCA 
TTTCTTGATT CAAAGCGTGG TAGTGATGGG CTTGTATCCG CAGAAGATGC AGCAACCTAT
GACAAAATGG AAGAAGACAT TATTAATCTC GGTAAGGAAA TAGCAAGATT GGAACGTCAA
GAGGCTCTTG AAGCAGAGCT TAATAAGCCA GTAAATATGC CTCTTACTGG AAAGCCAGCT
GTTCCAGGGA TGGATGCAAA GACCGGAAGA GCCAGTGATG AATATAGGAA AGCATTCTGG
AACGTAATGC GTAGCAAAAA CCCTCGTCAT GATGTGTTAA ACGCCTTATC TGTAGGCACT
GATTCTGAGG GAGGATACCT TGTTCCTGAT GAATTTGAGC GCACCTTGGT TCAAACTCTT
GAGGAAGAGA ATGTATTCCG TAAACTTGCA AAGATTATTC AAACTTCAAG TGGTGATCGT
AAAATCCCGG TTGTGGTGAC CAAAGGCACA GCTGCTTGGC TTGACGAAGG TGAGGAGTTT
GATGAGAGTG ATTCTGTATT CGGTCAGACA TCTATTGGTG CTTACAAGCT GGGTACAATG
ATTAAAGTTT CTGATGAACT TCTCAATGAC AGTGTATTTG ATCTGGAGAA TTATATCTCC
ACTGAATTTG CCCGTAGAAT CGGTGCTAAG GAAGAAGAAG CTTTTTTAGT TGGAGACGGA
GATGGAAAAC CTACTGGTAT TTTCAACGCA ACAGGCGGAG CACAGCTTGG AGTGACAGCA
GGGTCTGCAA CTGCTATTAC TGCAGATGAG ATTATCGATC TTGTTTACTC ATTAAAAGCG
CCATATAGAA AGAACGCGGT ATTCCTGATG AATGATGCAA CAGTAAAGGC AATCCGTAAG
CTGAAAGACG GTCAAGGTCA ATATCTGTGG CAGCCTTCTT TAACAGCAGG TACTCCAGAT
ACTTTATTAA ATCGTCCGGT TTATACTTCA GCTTATGCTC CTACTATTGA AGCTGGAGCT
AAAACTATTG CCTTCGGTGA TTTCGGATAT TATTGGATTG CCGATAGACA GGGACGTTCT
TTCAAACGTT TAAACGAGCT TTTTGCAACC ACAGGGCAGG TTGGTTTCCT TGCGAGCCAG
CGTGTAGATG GAAAGCTTAT CTTACCTGAA GCCATCAAAG TTCTTCAGCA GAAGGCTTAA
 
Protein sequence
MSKILELREK RAKAWEAAKA FLDSKRGSDG LVSAEDAATY DKMEEDIINL GKEIARLERQ 
EALEAELNKP VNMPLTGKPA VPGMDAKTGR ASDEYRKAFW NVMRSKNPRH DVLNALSVGT
DSEGGYLVPD EFERTLVQTL EEENVFRKLA KIIQTSSGDR KIPVVVTKGT AAWLDEGEEF
DESDSVFGQT SIGAYKLGTM IKVSDELLND SVFDLENYIS TEFARRIGAK EEEAFLVGDG
DGKPTGIFNA TGGAQLGVTA GSATAITADE IIDLVYSLKA PYRKNAVFLM NDATVKAIRK
LKDGQGQYLW QPSLTAGTPD TLLNRPVYTS AYAPTIEAGA KTIAFGDFGY YWIADRQGRS
FKRLNELFAT TGQVGFLASQ RVDGKLILPE AIKVLQQKA