Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nmul_A2071 |
Symbol | |
ID | 3784389 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrosospira multiformis ATCC 25196 |
Kingdom | Bacteria |
Replicon accession | NC_007614 |
Strand | - |
Start bp | 2361809 |
End bp | 2363173 |
Gene Length | 1365 bp |
Protein Length | 454 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 637812160 |
Product | 3-phosphoshikimate 1-carboxyvinyltransferase |
Protein accession | YP_412757 |
Protein GI | 229137829 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0128] 5-enolpyruvylshikimate-3-phosphate synthase |
TIGRFAM ID | [TIGR01356] 3-phosphoshikimate 1-carboxyvinyltransferase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.00742795 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGAAATTTA TTGACTTGCC GGTAGTCGAG CGTGCTCGGG GTAGCGTTCG CTTGCCTGGC TCAAAGAGCA TTTCCAATCG AATTTTGTTA CTTGCCGCGT TGTCGGAGGG GGTAACCGAT GTTTGCGACC TCCTGGCTTC CGATGATACT GCCCGTATGC TCGATGCCCT CTCCACTCTA GGTGTATCCA TCCTTCAGAT CGGTAGAGAC CATTATCGCT TGCAGGGAGT GGGGGACCAG TTTCCTCTTC GTTTACCTAC AACGGAAGCG GATCTGTTTT TAGGGAACGC GGGTACCGTT TTCCGTCCAT TGACGGCCAT GCTGGCACTT GCTCAAGGGC ACTACCGGCT GTCGGGTGTG CCGCGAATGC ATGAACGCCC CATTGGCGAC CTGGTCGATG CCCTGCGTCA AGTAGGCGCG GATATCACCT ATCTCGGGAA GGAAGGTTTC CCGCCGCTCC AGATCAAACC CGGGCGGATC CATCCGGGGG AAATAACGGT AAGGGGCGAG GTATCGAGTC AGTTCCTCAC TGCCCTGTTG ATGGTGCTGC CTTTCCTGCG CGCGGAAATG GATGAGTTAC CGGTCATCAC CGTGGCAGGG GAACTGATTT CGCGACCCTA CATCGATCTT ACCATCGCCT TGATGGCGCG TTTTGGGGTA CAGGTGGAGC GGGAGGAGTG GCGGCGCTTT ACTGTACCCG CAGACCAGCG CTATCGGAGT CCGGGTCAGG TATTCGTCGA GGGCGATGCG TCCTCAGCCT CCTATTTTCT TGCGGCCGGT GCAATCGGGA GAGGGCCCGT ACGCGTCGAA GGCTTAGGGC GTGACAGTGT CCAGGGAGAC ATTCGCTTTG CCGAGGCGCT CGAGCGAATG GGAGCGGATA TCAGATTTGG CGATAACTGG ATCGAAGCAA GCGGTCCCGG ACCGGGTGGC TTGCGAGCAA TCGATCTGGA CTGCAACCAT ATTCCGGATG CGGCGATGAC GCTTGCAGTG ACAGCCTTGT TCGCGCGGGG AAACACCGTT CTCAGGAATA TCGCGAGCTG GCGGGTAAAG GAAACCGATC GTATTGCAGC CATGGCGCAG GAATTGCGCA AACTCGGCGC AGAAGTGGAG GCAGGGTCCG ATTTTCTGCA AATCAGCCCG CCTCGCGGGG AACTGGTGGC GAATGCGGCT ATTGATACCT ATGATGATCA TCGCATGGCA ATGTGCTTTT CCCTGGTGTC TTTTGGCGCG CCGGTTCGAA TCAATGATCC CCGGTGTGTC TCAAAGACGT TTCCCGATTA TTTTGAAAAA TTTGCGGCTA TCGCCTATGC CGATCCAGGG CAGGGTAAAT TCGCTGCCCG GATCGATTCT TCTGATATTT CATGA
|
Protein sequence | MKFIDLPVVE RARGSVRLPG SKSISNRILL LAALSEGVTD VCDLLASDDT ARMLDALSTL GVSILQIGRD HYRLQGVGDQ FPLRLPTTEA DLFLGNAGTV FRPLTAMLAL AQGHYRLSGV PRMHERPIGD LVDALRQVGA DITYLGKEGF PPLQIKPGRI HPGEITVRGE VSSQFLTALL MVLPFLRAEM DELPVITVAG ELISRPYIDL TIALMARFGV QVEREEWRRF TVPADQRYRS PGQVFVEGDA SSASYFLAAG AIGRGPVRVE GLGRDSVQGD IRFAEALERM GADIRFGDNW IEASGPGPGG LRAIDLDCNH IPDAAMTLAV TALFARGNTV LRNIASWRVK ETDRIAAMAQ ELRKLGAEVE AGSDFLQISP PRGELVANAA IDTYDDHRMA MCFSLVSFGA PVRINDPRCV SKTFPDYFEK FAAIAYADPG QGKFAARIDS SDIS
|
| |