Gene Nmul_A2071 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmul_A2071 
Symbol 
ID3784389 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosospira multiformis ATCC 25196 
KingdomBacteria 
Replicon accessionNC_007614 
Strand
Start bp2361809 
End bp2363173 
Gene Length1365 bp 
Protein Length454 aa 
Translation table11 
GC content56% 
IMG OID637812160 
Product3-phosphoshikimate 1-carboxyvinyltransferase 
Protein accessionYP_412757 
Protein GI229137829 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0128] 5-enolpyruvylshikimate-3-phosphate synthase 
TIGRFAM ID[TIGR01356] 3-phosphoshikimate 1-carboxyvinyltransferase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00742795 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAAATTTA TTGACTTGCC GGTAGTCGAG CGTGCTCGGG GTAGCGTTCG CTTGCCTGGC 
TCAAAGAGCA TTTCCAATCG AATTTTGTTA CTTGCCGCGT TGTCGGAGGG GGTAACCGAT
GTTTGCGACC TCCTGGCTTC CGATGATACT GCCCGTATGC TCGATGCCCT CTCCACTCTA
GGTGTATCCA TCCTTCAGAT CGGTAGAGAC CATTATCGCT TGCAGGGAGT GGGGGACCAG
TTTCCTCTTC GTTTACCTAC AACGGAAGCG GATCTGTTTT TAGGGAACGC GGGTACCGTT
TTCCGTCCAT TGACGGCCAT GCTGGCACTT GCTCAAGGGC ACTACCGGCT GTCGGGTGTG
CCGCGAATGC ATGAACGCCC CATTGGCGAC CTGGTCGATG CCCTGCGTCA AGTAGGCGCG
GATATCACCT ATCTCGGGAA GGAAGGTTTC CCGCCGCTCC AGATCAAACC CGGGCGGATC
CATCCGGGGG AAATAACGGT AAGGGGCGAG GTATCGAGTC AGTTCCTCAC TGCCCTGTTG
ATGGTGCTGC CTTTCCTGCG CGCGGAAATG GATGAGTTAC CGGTCATCAC CGTGGCAGGG
GAACTGATTT CGCGACCCTA CATCGATCTT ACCATCGCCT TGATGGCGCG TTTTGGGGTA
CAGGTGGAGC GGGAGGAGTG GCGGCGCTTT ACTGTACCCG CAGACCAGCG CTATCGGAGT
CCGGGTCAGG TATTCGTCGA GGGCGATGCG TCCTCAGCCT CCTATTTTCT TGCGGCCGGT
GCAATCGGGA GAGGGCCCGT ACGCGTCGAA GGCTTAGGGC GTGACAGTGT CCAGGGAGAC
ATTCGCTTTG CCGAGGCGCT CGAGCGAATG GGAGCGGATA TCAGATTTGG CGATAACTGG
ATCGAAGCAA GCGGTCCCGG ACCGGGTGGC TTGCGAGCAA TCGATCTGGA CTGCAACCAT
ATTCCGGATG CGGCGATGAC GCTTGCAGTG ACAGCCTTGT TCGCGCGGGG AAACACCGTT
CTCAGGAATA TCGCGAGCTG GCGGGTAAAG GAAACCGATC GTATTGCAGC CATGGCGCAG
GAATTGCGCA AACTCGGCGC AGAAGTGGAG GCAGGGTCCG ATTTTCTGCA AATCAGCCCG
CCTCGCGGGG AACTGGTGGC GAATGCGGCT ATTGATACCT ATGATGATCA TCGCATGGCA
ATGTGCTTTT CCCTGGTGTC TTTTGGCGCG CCGGTTCGAA TCAATGATCC CCGGTGTGTC
TCAAAGACGT TTCCCGATTA TTTTGAAAAA TTTGCGGCTA TCGCCTATGC CGATCCAGGG
CAGGGTAAAT TCGCTGCCCG GATCGATTCT TCTGATATTT CATGA
 
Protein sequence
MKFIDLPVVE RARGSVRLPG SKSISNRILL LAALSEGVTD VCDLLASDDT ARMLDALSTL 
GVSILQIGRD HYRLQGVGDQ FPLRLPTTEA DLFLGNAGTV FRPLTAMLAL AQGHYRLSGV
PRMHERPIGD LVDALRQVGA DITYLGKEGF PPLQIKPGRI HPGEITVRGE VSSQFLTALL
MVLPFLRAEM DELPVITVAG ELISRPYIDL TIALMARFGV QVEREEWRRF TVPADQRYRS
PGQVFVEGDA SSASYFLAAG AIGRGPVRVE GLGRDSVQGD IRFAEALERM GADIRFGDNW
IEASGPGPGG LRAIDLDCNH IPDAAMTLAV TALFARGNTV LRNIASWRVK ETDRIAAMAQ
ELRKLGAEVE AGSDFLQISP PRGELVANAA IDTYDDHRMA MCFSLVSFGA PVRINDPRCV
SKTFPDYFEK FAAIAYADPG QGKFAARIDS SDIS