Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Teth514_1254 |
Symbol | |
ID | 5877029 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermoanaerobacter sp. X514 |
Kingdom | Bacteria |
Replicon accession | NC_010320 |
Strand | + |
Start bp | 1294819 |
End bp | 1296018 |
Gene Length | 1200 bp |
Protein Length | 399 aa |
Translation table | 11 |
GC content | 43% |
IMG OID | 641541604 |
Product | HK97 family phage major capsid protein |
Protein accession | YP_001662884 |
Protein GI | 167039899 |
COG category | [R] General function prediction only |
COG ID | [COG4653] Predicted phage phi-C31 gp36 major capsid-like protein |
TIGRFAM ID | [TIGR01554] phage major capsid protein, HK97 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 34 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGTAAAA TTCTTGAACT GCGTGAAAAA CGCGCAAAAG CATGGGAAGC AGCAAAGGCA TTTCTTGATT CAAAGCGTGG TAGTGATGGG CTTGTATCCG CAGAAGATGC AGCAACCTAT GACAAAATGG AAGAAGACAT TATTAATCTC GGTAAGGAAA TAGCAAGATT GGAACGTCAA GAGGCTCTTG AAGCAGAGCT TAATAAGCCA GTAAATATGC CTCTTACTGG AAAGCCAGCT GTTCCAGGGA TGGATGCAAA GACCGGAAGA GCCAGTGATG AATATAGGAA AGCATTCTGG AACGTAATGC GTAGCAAAAA CCCTCGTCAT GATGTGTTAA ACGCCTTATC TGTAGGCACT GATTCTGAGG GAGGATACCT TGTTCCTGAT GAATTTGAGC GCACCTTGGT TCAAACTCTT GAGGAAGAGA ATGTATTCCG TAAACTTGCA AAGATTATTC AAACTTCAAG TGGTGATCGT AAAATCCCGG TTGTGGTGAC CAAAGGCACA GCTGCTTGGC TTGACGAAGG TGAGGAGTTT GATGAGAGTG ATTCTGTATT CGGTCAGACA TCTATTGGTG CTTACAAGCT GGGTACAATG ATTAAAGTTT CTGATGAACT TCTCAATGAC AGTGTATTTG ATCTGGAGAA TTATATCTCC ACTGAATTTG CCCGTAGAAT CGGTGCTAAG GAAGAAGAAG CTTTTTTAGT TGGAGACGGA GATGGAAAAC CTACTGGTAT TTTCAACGCA ACAGGCGGAG CACAGCTTGG AGTGACAGCA GGGTCTGCAA CTGCTATTAC TGCAGATGAG ATTATCGATC TTGTTTACTC ATTAAAAGCG CCATATAGAA AGAACGCGGT ATTCCTGATG AATGATGCAA CAGTAAAGGC AATCCGTAAG CTGAAAGACG GTCAAGGTCA ATATCTGTGG CAGCCTTCTT TAACAGCAGG TACTCCAGAT ACTTTATTAA ATCGTCCGGT TTATACTTCA GCTTATGCTC CTACTATTGA AGCTGGAGCT AAAACTATTG CCTTCGGTGA TTTCGGATAT TATTGGATTG CCGATAGACA GGGACGTTCT TTCAAACGTT TAAACGAGCT TTTTGCAACC ACAGGGCAGG TTGGTTTCCT TGCGAGCCAG CGTGTAGATG GAAAGCTTAT CTTACCTGAA GCCATCAAAG TTCTTCAGCA GAAGGCTTAA
|
Protein sequence | MSKILELREK RAKAWEAAKA FLDSKRGSDG LVSAEDAATY DKMEEDIINL GKEIARLERQ EALEAELNKP VNMPLTGKPA VPGMDAKTGR ASDEYRKAFW NVMRSKNPRH DVLNALSVGT DSEGGYLVPD EFERTLVQTL EEENVFRKLA KIIQTSSGDR KIPVVVTKGT AAWLDEGEEF DESDSVFGQT SIGAYKLGTM IKVSDELLND SVFDLENYIS TEFARRIGAK EEEAFLVGDG DGKPTGIFNA TGGAQLGVTA GSATAITADE IIDLVYSLKA PYRKNAVFLM NDATVKAIRK LKDGQGQYLW QPSLTAGTPD TLLNRPVYTS AYAPTIEAGA KTIAFGDFGY YWIADRQGRS FKRLNELFAT TGQVGFLASQ RVDGKLILPE AIKVLQQKA
|
| |