Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tgr7_1866 |
Symbol | |
ID | 7315196 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thioalkalivibrio sp. HL-EbGR7 |
Kingdom | Bacteria |
Replicon accession | NC_011901 |
Strand | + |
Start bp | 1989786 |
End bp | 1990949 |
Gene Length | 1164 bp |
Protein Length | 387 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 643616757 |
Product | protein of unknown function DUF1016 |
Protein accession | YP_002513934 |
Protein GI | 220935035 |
COG category | [S] Function unknown |
COG ID | [COG4804] Uncharacterized conserved protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCGCCC GGACGCCCAA AGCCGCCAAA GACGCCGCCC TGCCTGCCGG CTACGCCGGC ATCCACGGCG GCATCGTGGA ACTGCTGGAC GCCGCGCGCC AGGCGGCGGC GCGCAGCGTC AATGCGCTGA TGACGGCCAG CTATTGGGAA ATCGGCCGCC GCATCGTGGA GGCCGAGCAA CAGGGCAAGC GGCGCGCGGG CTATGGCGAG CAGTTGATTG CCCGGCTGTC CGCCGACCTG ACCGCGCGCT TCGGGCGCGG TTTCAGCCCG GACAATCTGG AGAACATGCG GCGGTTCTTC GCCGCCTACC CCCGGCCCAT GATTTCCGAG GCACTGTCTC GGAAATCAGG CGACGAACCG CCTGCCGAGA TTTCCGAGAC AGTGTCTCGG AAATTCGCCC TGGCCGAGCT GGCGCAGGTG TTCCCGCTGC CGTGGTCGGC CTACGTGCGG CTGCTGGCGG TCAAGGACGA CCACGCCCGC CGGTTCTACG AGGCCGAGGC GTTGCGCGGC GGCTGGAGCG TGCGCCAGCT TGACCGGCAG ATCGGCAGCC AGTTCTACGA GCGCACGGCC TTGTCCAAGG ACAAGGCGGC GATGCTGGTC AAGGGCGCAG CGCCGAGGCC CGAGGATGCC GTCAGGCCCG ACGACGCCAT CAAAGACCCC TACGTGCTGG AGTTCCTGAA CCTCAAGGAC GAGTATTCCG AATCCGATCT GGAGGCCGCG CTGATCCAGC GGCTGGAGGA TTTTCTGCTG GAGCTGGGCG AAGGGTTCAC CTTCGTCGGG CGGCAGCGGC GCTTGCGCAT CGACCAGACC TGGTATCGGG TGGATCTTCT GTTTTTCCAC CGGCGGCTGC GCTGCCTGGT CATCATCGAC TTGAAGCTGG GCAGCCTGTC CCATGCCGAC GTGGGCCAGA TGCTCATGTA TTGCAACTAC GCCAAGGAGC ATTGGGCCTA TCCCGATGAA AACCCGCCCG TGGGGTTGAT CCTGTGCGCC GACAAGGGCC ATGCCCTGGC GCGGTATGCC TTGGAAGGTT TGCCGTCAAA GGTGATGGCG GCGAACTACC GTACCGTGCT GCCGGATGCC GAGCTGTTGC AGAAGGAATT GGAGACTACG CGGCGCTTGC TGGAATCGCG CACGTCGAAG CAGCCCAAGA AACTCCCGCA GTAA
|
Protein sequence | MSARTPKAAK DAALPAGYAG IHGGIVELLD AARQAAARSV NALMTASYWE IGRRIVEAEQ QGKRRAGYGE QLIARLSADL TARFGRGFSP DNLENMRRFF AAYPRPMISE ALSRKSGDEP PAEISETVSR KFALAELAQV FPLPWSAYVR LLAVKDDHAR RFYEAEALRG GWSVRQLDRQ IGSQFYERTA LSKDKAAMLV KGAAPRPEDA VRPDDAIKDP YVLEFLNLKD EYSESDLEAA LIQRLEDFLL ELGEGFTFVG RQRRLRIDQT WYRVDLLFFH RRLRCLVIID LKLGSLSHAD VGQMLMYCNY AKEHWAYPDE NPPVGLILCA DKGHALARYA LEGLPSKVMA ANYRTVLPDA ELLQKELETT RRLLESRTSK QPKKLPQ
|
| |