Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tgr7_2467 |
Symbol | |
ID | 7317163 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thioalkalivibrio sp. HL-EbGR7 |
Kingdom | Bacteria |
Replicon accession | NC_011901 |
Strand | - |
Start bp | 2620361 |
End bp | 2621863 |
Gene Length | 1503 bp |
Protein Length | 500 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 643617371 |
Product | Guanosine-5'-triphosphate,3'-diphosphate diphosphatase |
Protein accession | YP_002514532 |
Protein GI | 220935633 |
COG category | [F] Nucleotide transport and metabolism [P] Inorganic ion transport and metabolism |
COG ID | [COG0248] Exopolyphosphatase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTTTCGAC GCCGCCTGAC GCCAGAAACC GTTGCCGCCG TGGACCTGGG GTCCAACAGC TTCCACATGA TCGTCGCGCG GGTGAAGGAC GGTCATGTGC ACATGCTCGA TCGCCTGCGC GAGACCGTGC GCCTGGCGGG CGGCCTGGAT GATCGCAATC AGCTCTCGGA AAAGGCCATG GAACGCGCCC TGGGCTGCCT GGAGCGCTTC GGTCAGCGGG TGCGTGATCT GCCGCCCGGC GCGGTGCGCG CGGTGGGGAC CAACACCCTG CGTAAGGCGC GCAATGCGGC CGAGTTCATG GAGCGGGCCA CCGAGGCGTT GGGGCACCCC ATCGAGGTGA TCGGCGGCCA CGAGGAGGCA CGCCTCATCT ATCTGGGGGT CTCCCACACC CTTGCCGATG ATGGCGGCCA CCGCCTGGTG GTGGACATCG GCGGCGGCAG CACCGAACTC ATTATCGGCG AGCGCTTCGA GCCGGTGTAC GGCGAGAGCC TGTACATGGG CTGCGTGAGT TTCAGCCAGT GGTATTTCCC CGAAGGGGCG ATCACGGAGA CCGGCTGGCG CACCGCGGAC ATCGCTGCGC GACTGGAACT GCAACCGGTG GAGCGTCAGT TCCGGCGCAT GGGTTGGCGC GAGGCCCTGG GTGCCTCGGG CACCCTGCTG GCGGTGGAGC GGGTGCTGCG TGAGCAGGGC TGGAGTACTG GCGGCATCAC CCTGGAGGGG CTCAGGCAAC TGCGCCAGGC CATGCTCAAG GCCGGTCACG TCTCGCGTCT CGAACTCAAC GGCCTGAACG AGGACCGCAA GCCGGTGTTC CCCGGCGGGG TGGCGGTGAT CATGGCCATC TTCGAGGCCC TGGACATCGC CAGCATGCAG GTCTCCGACG GCGCCCTGCG CGAGGGCTTG ATCTATGACC TGCTCGGGCG CATCAGCCAC GAGGACGTGC GCGCGCGCAC CATCGAGGGC CTGATGCGCC GCTTCGGCGT GGACGAGGCC CACGCCCGCC GGGTGGAGAC CACGGCGCGG CAATTGCTCG AATTCGCAGG CGAGGCCTGG AACCTGGACG AGGACGCCGC GGATACCCTG TCGTGGTCCG CGCGGCTGCA TGAACTGGGT CTGGCCATCT CCCATTCCGG CTACCACAAG CACGGTGCCT ACCTGCTGGA GAACGCGGAC CTGGCAGGCT TCTCCCGCCA GGACCAGCGT CAGCTCGCGC TGCTGGTGCT GGCCCACCGG CGCAAGTTCC CCGCCAAGCT CATGGCCCAG GCCCTGCAGG ACGAGGCCCT GGAGCGCATC ACCCGCCTGG CCGTGCTGCT GCGTCTGGCG GTGCTGCTGC ACCGAAGCCG GGGCGAGAAT GCGCCGGCGC TGGAGAGACT CGAGGTCATG CGCAACGGTG TGAAGCTGTG CTTCCCGCCC GGCTGGCTGG AGACCCACCC CATGACCCGG GCGGACCTGG AACGGGAAAA GGGCTACCTG AAATCGGCGG GGATCAAGCT CAAGTTCGAG TGA
|
Protein sequence | MFRRRLTPET VAAVDLGSNS FHMIVARVKD GHVHMLDRLR ETVRLAGGLD DRNQLSEKAM ERALGCLERF GQRVRDLPPG AVRAVGTNTL RKARNAAEFM ERATEALGHP IEVIGGHEEA RLIYLGVSHT LADDGGHRLV VDIGGGSTEL IIGERFEPVY GESLYMGCVS FSQWYFPEGA ITETGWRTAD IAARLELQPV ERQFRRMGWR EALGASGTLL AVERVLREQG WSTGGITLEG LRQLRQAMLK AGHVSRLELN GLNEDRKPVF PGGVAVIMAI FEALDIASMQ VSDGALREGL IYDLLGRISH EDVRARTIEG LMRRFGVDEA HARRVETTAR QLLEFAGEAW NLDEDAADTL SWSARLHELG LAISHSGYHK HGAYLLENAD LAGFSRQDQR QLALLVLAHR RKFPAKLMAQ ALQDEALERI TRLAVLLRLA VLLHRSRGEN APALERLEVM RNGVKLCFPP GWLETHPMTR ADLEREKGYL KSAGIKLKFE
|
| |