Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tgr7_2110 |
Symbol | |
ID | 7318213 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thioalkalivibrio sp. HL-EbGR7 |
Kingdom | Bacteria |
Replicon accession | NC_011901 |
Strand | - |
Start bp | 2234861 |
End bp | 2236096 |
Gene Length | 1236 bp |
Protein Length | 411 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 643617004 |
Product | hypothetical protein |
Protein accession | YP_002514177 |
Protein GI | 220935278 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG3206] Uncharacterized protein involved in exopolysaccharide biosynthesis |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.730509 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCTGATA TGAAAGGATC GCAGAGTGAT GAAGCGGAGC AAACTCGTGG CGGGTCCCGC CGGGAGGCAG CCTACTATCC GGATGACGAG ATCAGCTTGA TCGACCTGTG GCGCACGCTG GTCAGAAGAA GGTCCATTAT TGGGTTTACA GTCGTGGGGA GCTTGGTCCT CGGTATTATG TTTTTGATAG TCAAGCCCGA TACGTACGTT TTCGGCAGCA CTATCGAAAT CGGTCATTAT CCAACCGAGG CAGGCTCCAG AGAGTTGGCG TCCCTCGATA CGGTCGAGAA TGCCCTGTCC AAGCTTCAGG ATGGCTATAT TCCTGAGGCG CTTCGTCGCT ACATGGATGA AGGCCACGAA GAATCCAGAC TGCGTGTCAC GGCTCGCAAC CCAAGGGGTA GCAATCTGGT TGTGATCGAG GTCAAGGGAA CGCTGGATGA GGGAGAGTCA ATCCTCCGTG TGCTCAACTA CGCCAGTAAT CAACTGATCG AGGATCATCG CTATCTGTTG GATATTCAGC GCCAGCAACT CGATTCCCGT CTTGAGCAGG CCCGGTTGGA ACTTGCGAAG TATGAGGATG AGCGCATGTT CCGTATTCAG GAATTGGAAC GTGTACGGGC CATTGAAAGA ACCAAGCTAG AGCTGAGTGA ACTCCTCCAG CAACAGGAAC TGGTGGAGTC ACGCATCAAG CGCTTGGATG ATATGCAAAG ACTGCTGGAA CAGCAGATCA AGGAGATTCG GCATTCTATC GAGGTGGCAT CTGAAACGAG GCGTGCGGCG GTCGCCAATG TTGGCGACCC CGCGAATGCC ATGACCTTGC TCATGATCGA CAGCGACATC CAGCAGAACC GCAATCGCCT TGCGACCCTG GACGAAAGGC TGCACATCGG CCTGCCCAAT GAATTCGATG AACTTCAGAA GCGACTGGAC GACATTCTGC GGGCTCAGTC GAACATGAAA ACGGAGATTG AGGCCAGGGA GTCTGAACTG GAACGCTTCC GCATTGACTG GGAAAGGTCG ACGGAGGCGC AGCGTCAGAC CGTGCGCGCG GTGGAGGCCA GACTCAATGC CGCCCGGGAT ACGCGAGTCG TGACCCAGCC CGCGCGCAGT CTTGAGCCCG AAGGGCCTGG CAAGTCCATC ATCTTGGCGT TATCCCTGTT CTTGGGCCTG ATGCTGGGCG TCTTCGCCGC CTTCTTCGCC GAATTTCTGA GAAACGCCCG CGAGGAAACA GCATAG
|
Protein sequence | MADMKGSQSD EAEQTRGGSR REAAYYPDDE ISLIDLWRTL VRRRSIIGFT VVGSLVLGIM FLIVKPDTYV FGSTIEIGHY PTEAGSRELA SLDTVENALS KLQDGYIPEA LRRYMDEGHE ESRLRVTARN PRGSNLVVIE VKGTLDEGES ILRVLNYASN QLIEDHRYLL DIQRQQLDSR LEQARLELAK YEDERMFRIQ ELERVRAIER TKLELSELLQ QQELVESRIK RLDDMQRLLE QQIKEIRHSI EVASETRRAA VANVGDPANA MTLLMIDSDI QQNRNRLATL DERLHIGLPN EFDELQKRLD DILRAQSNMK TEIEARESEL ERFRIDWERS TEAQRQTVRA VEARLNAARD TRVVTQPARS LEPEGPGKSI ILALSLFLGL MLGVFAAFFA EFLRNAREET A
|
| |