Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tgr7_1541 |
Symbol | |
ID | 7316165 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thioalkalivibrio sp. HL-EbGR7 |
Kingdom | Bacteria |
Replicon accession | NC_011901 |
Strand | + |
Start bp | 1654545 |
End bp | 1655870 |
Gene Length | 1326 bp |
Protein Length | 441 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 643616432 |
Product | 3-phosphoshikimate 1-carboxyvinyltransferase |
Protein accession | YP_002513612 |
Protein GI | 220934713 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0128] 5-enolpyruvylshikimate-3-phosphate synthase |
TIGRFAM ID | [TIGR01356] 3-phosphoshikimate 1-carboxyvinyltransferase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCACAAG CACAAGAACT CGCATTCGAG GTCCAATCCG GTGGCACGCT GACCGGGCGC ATCCGGGTGC CCGGGGACAA GTCCATTTCC CATCGTTCCA TCATGCTGGG CTCCCTGGCC GAGGGCACCA CGGAGGTCAC CGGCTTCCTC AATGGTGAGG ACTGCATGGC CACCCTCGCC GCCTTCCGGG CCATGGGCGT GCAGATCGAC GGGCCCAGGG AGGGCAGGGT GACCATTCAG GGCGTCGGCC TGCACGGCCT CAAGGCACCC GCCGGGCCCC TGGATCTGGG CAATTCCGGC ACCTCCATGC GTCTCATGTC CGGCCTGTTG GCGGGGCAGG CCTTCGATAC CACCCTGGTG GGCGATGCCT CCCTGACCAA GCGGCCCATG AAGCGGGTCA CCGAGCCGCT GGCTGCCATG GGCGCCCGGA TCGACACCAG CGCGACCGGC ACGCCACCCC TGCATGTGCA CGGCGGCCAG ACACTCAGGG GCATCGACTA TCAAATGCCC ATGGCCAGCG CCCAGGTGAA GTCCTGCCTG CTGCTCGCTG GCCTCTACGC CGAGGGCAGC ACCTGCGTCA CCGAGCCGGC CCCCACCCGG GACCACACCG AGCGCATGCT CACCGGCTTC GGCTATCCGG TGAGCCGCGA GGGCAACCGG GCCTGCGTCC AGGGCGGCGG GCGGCTCAAG GCCACCCGCA TCGACGTGCC CGCGGACATC TCCTCGGCGG CCTTCTTCCT GGTGGGCGCC AGCATCGCCA AGGGCTCCGA GATCACCCTG GAACACGTGG GCATGAACCC CACCCGGGTG GGCGTGATCG ACATCCTGCG GCTCATGGGC GCCGAGATCC ACGTGGAGAA CCCCCGGGAG GCGGGCGGTG AACCGGTGGC GGATCTGAGA GTGGTGAGCG CGCCGCTTCG GGGCGTGCGC ATCCCCGAGG AACTGGTGCC CCTGGCCATC GACGAATTCC CGGCGCTGTT CATAGCCGCC GCCTGTGCCG AGGGGGAGAC CCTGCTCACC GGGGCCGAGG AACTGCGGGT CAAGGAGAGC GACCGCATCC AGGTGATGGC CGACGGCCTG CTGGCCTGCG GCATCGAGGC CGAGCCCACC CCGGACGGCA TCCGCATCCG CGGCGGCCAG CTGCGCGGCG CCACCGTGGA CAGCCACGGG GATCACCGCA TCGCCATGAG CTTCGCCATG GGGGCCCTGC GCGCCGAGGG GGCCATGCAC ATCCGCAACT GCGCCAACGT GAACACCTCC TTCCCGGGCT TCGTGGAACT CGCCGCAGGG GCAGGGCTCG CGATCACCCA GGGGCCCCAG TCGTGA
|
Protein sequence | MSQAQELAFE VQSGGTLTGR IRVPGDKSIS HRSIMLGSLA EGTTEVTGFL NGEDCMATLA AFRAMGVQID GPREGRVTIQ GVGLHGLKAP AGPLDLGNSG TSMRLMSGLL AGQAFDTTLV GDASLTKRPM KRVTEPLAAM GARIDTSATG TPPLHVHGGQ TLRGIDYQMP MASAQVKSCL LLAGLYAEGS TCVTEPAPTR DHTERMLTGF GYPVSREGNR ACVQGGGRLK ATRIDVPADI SSAAFFLVGA SIAKGSEITL EHVGMNPTRV GVIDILRLMG AEIHVENPRE AGGEPVADLR VVSAPLRGVR IPEELVPLAI DEFPALFIAA ACAEGETLLT GAEELRVKES DRIQVMADGL LACGIEAEPT PDGIRIRGGQ LRGATVDSHG DHRIAMSFAM GALRAEGAMH IRNCANVNTS FPGFVELAAG AGLAITQGPQ S
|
| |