Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tgr7_1709 |
Symbol | |
ID | 7315731 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thioalkalivibrio sp. HL-EbGR7 |
Kingdom | Bacteria |
Replicon accession | NC_011901 |
Strand | + |
Start bp | 1807040 |
End bp | 1809046 |
Gene Length | 2007 bp |
Protein Length | 668 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 643616600 |
Product | hypothetical protein |
Protein accession | YP_002513778 |
Protein GI | 220934879 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCCAACG TGTTGTCGGA TAAGCAGTTT CTACGGGAGA TGCCATTCGT CAACACATGC AATGAAACCC TGCTGGCTCT GGGTGAGTGG TGGCGCAAGG TCGCCCTGAT CGGAAAAATC AGCAGCCTGG ACGTGGCGGC CACAATCCTG GACGACATGG ACCTGGCCCG CAACGGGTTC CAGACACTCC AGAAGGAACT CATCGATAAC CTTGTCCGGG AGAATCTGCG CAAGCTGGAC CAGGAACTGG GCGCCCGGGC GCAGTTTGCC ATCGACATCC TGATCCGCAA TCTCTTCGAG CGCACGGCGG ATGTGGGTTT CCTGGCGACG GACGAGGACA TCCGCGAATT CCTGCGTCAC CCGGAGGCCT CGCCGGAGGC CACCGAGGCG ATCGTCGAGC GCCTGCGCGA ATACACCCTC AAATACAGTG TGTATGACGA AATCCTGCTG CTCGACCCCC AGGGTCATGT TCGCGCACAC CTGGACCCCG CCAACCCGGT GGTCCATTCC AGGGATCCGC TGATCAGTGA AACACTCCGC GGCGAGGCCC CCTATGTCGA GACATTCCGC CCCACGGATC TCCTGCCCGG GCGCCGCGCC GGTCTGATCT ACTCGGCACC GGTCACCGAC TCAGACCAGC CCGGTGCCCG GGTCCTCGGT CTGCTCTGTC TGAGCTTTCG TTTTGATGAC GAGATGGCCG GCATCTTTGC CAACCTGGTG AGGCCCGGAG AGTTGGTGGC AATCCTCGAC CGCAACGACC GCGTCATTGC CAGCAGTGAT GAACGGCGCC TGCCCATCGG CACCTCGATC ACCGGTGCCC GTGACGGGCA GCTGTCCAGA CTGGGCCATG GCTCGGACGC TTATCTGGCT CGGAACTGCC AGACCAAGGG CTATCAGGGT TATATGGGGC TGTCCTGGAA AGGGCAGGTG ATGGCGCCGA TGGAATCCGC CTTCAGGACT GGCGAGCAGG GCGGGGAGGG TAGCGATGCC GGGACACACC AGGAGGGTGA ATTCATCTCT CTTGAGTTGC GCAACATCCG CAAGAACGCC GCCCGGGTCA CCGACGACCT GTCCCTGATC GTGCTGAACG GACAGATCGT TGCCGCCAAG CGCGATGCCC ACGAGTTCAT GCCCGTGTTG AGCGAGATCC GGGTGATCGG CAACCGCACC CGGCAGGTCT TCGACAACTC GATTACCCGC CTCTACGCGA CCGTGCTGGA ATCCCTGATG AACGAGGTCC AGTTCCAAGC GTTTCTGGCC GTGGACATCA TGGACCGCAA CCTCTACGAG CGTGCCAACG ACGTTCGCTG GTGGGCGCTG ACCAGCCGTT TCAGGGAGAT CCTGGACCTG ACCACGCGCA CTGAAGATCA ACGGCAGACG CTGACGGAGA TCCTCGCCTA CATCAACAGT CTCTATACCG TTTACACAAA TCTGATCCTG TTCGATGCCA ACCGGGTCAT CGTTGCAGTC TCCAGCCCCG AAGAGCAGCA TCTGCTCGGC ACGATTCTGC CTGCCGAGGG CGGCTTCTCT GAGGCACTCG GGATCCGCGA TTCACAGCGT TATGTGGTAT CGCCGTTTAA TTCCACGCAT CTCTACAAAG ATCGCCCGAC CTATATCTAC ATGACCTCCG TGCGTTCTCC GAAAAGCGGA CGCGCCCTTG GCGGCATTGC AATCGCCTTT GACAGCGAAC CACAGTTCGC TGCGATGCTC GAGGATGCGC TTCCCAGGGA TGATGCGGGC CATATCATCG AAGGCAGCTT CGCCGTCTTC GCGGATCGCC AGGGCAATGT CATCAGCGCC ACCGGTGGAG ACCTGCGCCC CGGTGACCGG ATCGATCTGG AAGGAGATCT GCTCAGTCTC GAGAACGGCA CGCGTCGATC CGCACTGATT GAATATCGCG GCAGAAACTA CGCGGTGGGC GCTGCGGTGT CTCAGGGATA CCGGGAGTAC AAGACCACCA ACGACTACGA CAATGACGTG GTGGCACTGG TGTTCATGTC TGTCTAG
|
Protein sequence | MSNVLSDKQF LREMPFVNTC NETLLALGEW WRKVALIGKI SSLDVAATIL DDMDLARNGF QTLQKELIDN LVRENLRKLD QELGARAQFA IDILIRNLFE RTADVGFLAT DEDIREFLRH PEASPEATEA IVERLREYTL KYSVYDEILL LDPQGHVRAH LDPANPVVHS RDPLISETLR GEAPYVETFR PTDLLPGRRA GLIYSAPVTD SDQPGARVLG LLCLSFRFDD EMAGIFANLV RPGELVAILD RNDRVIASSD ERRLPIGTSI TGARDGQLSR LGHGSDAYLA RNCQTKGYQG YMGLSWKGQV MAPMESAFRT GEQGGEGSDA GTHQEGEFIS LELRNIRKNA ARVTDDLSLI VLNGQIVAAK RDAHEFMPVL SEIRVIGNRT RQVFDNSITR LYATVLESLM NEVQFQAFLA VDIMDRNLYE RANDVRWWAL TSRFREILDL TTRTEDQRQT LTEILAYINS LYTVYTNLIL FDANRVIVAV SSPEEQHLLG TILPAEGGFS EALGIRDSQR YVVSPFNSTH LYKDRPTYIY MTSVRSPKSG RALGGIAIAF DSEPQFAAML EDALPRDDAG HIIEGSFAVF ADRQGNVISA TGGDLRPGDR IDLEGDLLSL ENGTRRSALI EYRGRNYAVG AAVSQGYREY KTTNDYDNDV VALVFMSV
|
| |