Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tter_2520 |
Symbol | |
ID | 8640549 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermobaculum terrenum ATCC BAA-798 |
Kingdom | Bacteria |
Replicon accession | NC_013526 |
Strand | + |
Start bp | 682702 |
End bp | 684354 |
Gene Length | 1653 bp |
Protein Length | 550 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | |
Product | hypothetical protein |
Protein accession | YP_003324231 |
Protein GI | 269839539 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.00000303033 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGACGTGA CGGAGCTGAG CTATGATCCT CAAGATCCGT TCTTTTCGAT CCCTGGCGAC GACCGTGGCA TGGCCTGGGG CGTGCAGATA TTTACCACCG GTAACCTCTA CGGGCTTGCA CCCGAGCACA CCGAGGTTGT AGCCGAGGGA GACGGGCTGC GTGTGAGCTG TAATCGCCTC AGCTGGGGTG GGCAGCAGGG GCGCAGCGAA GGACAGGTCG AGGCTGTGGT CAGCGTGCGT GGGCGCAGCG TGACGCTGCG CATCAGCGCC TGGCACGATG AGCCCGTGAA GGCGATCAAG CTACTGTTGC GTAACCTCCC CACAGAGGTC CAAAGAGAGG GACTCTGGCA CCCCACCCTC CCCCAGGGAG AGAGTCTCTG GCCTAGGTCA GGCTCTCCCG TTCTCTGGCG CTACCCCGGA CCGGAGTGGC TGACCCCGTG GGCATGCATT GGGGAGGCTC AAGGCGCGAT CTCCATCAGC GTCAGGGATC CTGAGGTCAG AGCTAAGCGC CTATACGCGC ATGTGCCCCC ATACCTACAC CTTCCAGAAG TGGAGGTCAT CTGTGAAGAG AGCGCACTCA GGTGGTGTTC CCACTTCACT AGTCCTGAGA TCAAGATAAG CTTCTTGAGT GGTTTGGCCG ATCTGGAGGC CGATTTTAGG GCTCACCTTG AGTTCCTTGA GGGCTCGTAT GGGTTGCGTA GGTGGGAGGA CAGGCAGGAC GTGCCCGGCT GGGCGAGGGA CATAAAGCTG GTGGTTTACC TGCATGGACA ACATTGGACG GGTCATGTAT TCAATACGTT TGACAAGATG GCCTATGTCC TGGAACAGCT CACCAGGCTT ATTGCTGGTC GCCATGTACT GTGCTTTGTA CCGGGATGGT CTGGGCGGTA CTACTTCTCG TATCCTCTGT ACGCTCCGGG TGAGTCGTTG GGAGGAAGCG GCGCCTTTGA GCGTTTTATA TCCGTAGCGC ACAGGTTGGG GGTCAAGGTG ATGCCGATGT TCGGAGCGAC GGGCGCGAAC GCCCGCCTCT ATCCAGGATG GGAGAGGGCC GCGTTCTTGA ATCGCACCGG CAGGGTGGTC AAGCTCATCA ACGCCCCCGA CTGGGACACG GATAGGCACG AGGAGGACGA CCAAGTCTTC TTGAACCCTG GAGAGCCCAG CTTCCGGCAG CACCTGATCG AGGAGATCAG ACGCACGGTA GCACGTTATG GGGTTGACGC CGTGTTCCTG GATGCGTCGT CGGCTTGGTT CAACGATCCC AGGTATGACG TCTACGCAGG CTATCGAGAG CTGGTGGCGA GCCTGAGGGA GCGCTTCCCC CACATATTGA TCGCTGGCGA GGGATGGTAC GATGCCCTGC TCGCTCTGCT CCCAATGAAC CAGAGCCTCT TGGGAGTCTC GGTACGCTAT AGGTTGCCAG AGCTACTTAC TCGCTATGCG CGCGTCTTTG GCCACATGGC ACAGGGAGCA CCGGCGTCTA CTTGGCCACA TGGCACAGGG AGCACCGGCG TCTACGAGGA GGGCTTTCTG CCTGTGCGGC ACGAGCAGCC GGAGTTCGGT CACATCCTCA GCGTAAACGT GGTAGAGGAC ACTCTGGATC GGTACTGGGA AGAGATGGCT ACCATCTGTC GCATGGCTCT TCGCAGCGCT TGA
|
Protein sequence | MDVTELSYDP QDPFFSIPGD DRGMAWGVQI FTTGNLYGLA PEHTEVVAEG DGLRVSCNRL SWGGQQGRSE GQVEAVVSVR GRSVTLRISA WHDEPVKAIK LLLRNLPTEV QREGLWHPTL PQGESLWPRS GSPVLWRYPG PEWLTPWACI GEAQGAISIS VRDPEVRAKR LYAHVPPYLH LPEVEVICEE SALRWCSHFT SPEIKISFLS GLADLEADFR AHLEFLEGSY GLRRWEDRQD VPGWARDIKL VVYLHGQHWT GHVFNTFDKM AYVLEQLTRL IAGRHVLCFV PGWSGRYYFS YPLYAPGESL GGSGAFERFI SVAHRLGVKV MPMFGATGAN ARLYPGWERA AFLNRTGRVV KLINAPDWDT DRHEEDDQVF LNPGEPSFRQ HLIEEIRRTV ARYGVDAVFL DASSAWFNDP RYDVYAGYRE LVASLRERFP HILIAGEGWY DALLALLPMN QSLLGVSVRY RLPELLTRYA RVFGHMAQGA PASTWPHGTG STGVYEEGFL PVRHEQPEFG HILSVNVVED TLDRYWEEMA TICRMALRSA
|
| |