Gene Nmul_A2369 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmul_A2369 
Symbol 
ID3785306 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosospira multiformis ATCC 25196 
KingdomBacteria 
Replicon accessionNC_007614 
Strand
Start bp2695012 
End bp2696493 
Gene Length1482 bp 
Protein Length493 aa 
Translation table11 
GC content58% 
IMG OID637812458 
Productanthranilate synthase component I 
Protein accessionYP_413050 
Protein GI82703484 
COG category[E] Amino acid transport and metabolism
[H] Coenzyme transport and metabolism 
COG ID[COG0147] Anthranilate/para-aminobenzoate synthases component I 
TIGRFAM ID[TIGR00564] anthranilate synthase component I, non-proteobacterial lineages 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACGGAAG CAGAATTCCA TGCTCTGGCG ACAAAGGGCT ACACCCATAT TCCTGTCGTC 
CTGGAAACCT TTGCCGATCT TGACACCCCG TTGTCGGTTT ATCTGAAGCT GGCAAATGAG
CCTTATTCCT ATCTGCTGGA ATCCGTTCAG GGAGGCGAGC GTTTTGGACG TTATTCAGTT
ATCGGCTTAC CCGCCCGCAA CCGAATCGAA GTGCGCGGCA ACGATGTCAG CGTAATCGAC
GGCAGCACCA GCCAGGTATT CCAGTCGGAA GATCCTCTTG CTTTCGTTCA ATCCTATCTG
GCGCGCTTCA AGGCGGCACC TTATCCCGGC TTGCCGCGGT TTTGCGGCGG TCTCGCGGGC
TACTTCGCGT ACGATACGGT ACGTTATATC GAGCGCAGGC TCGCGGGTGG GTGCGCGTAC
AGGCTGCCTG ACACACTGGA TACGCCCGAC ATCCTGCTGC TGGTTTCGGA AGAACTGGCC
GTGGTGGATA ACCTTTCCAG CAAGCTCTAC CTCATCGTAT ATGGGGACGC CGCTCTTGAG
GGCGCGTACT CCAGCGCCCG GAAACGGTTG AAAGAACTGT TGGGATCTCT CCGGGAACCG
TTACGGATTC CATTGGAAAT GCCGTGTGAA TCGGGTGCGC CGGTTTCGCA GTTCGCGGAG
GCGGATTTCA TCTCGGCAGT AGAACGGGCC AAGCGCTATA TTTTTGATGG CGATATCATG
CAGGTGGTGC TATCGCAGCG CACCAGCAAA CCGTACGGCG CATCACCGCT CGCGCTGTAC
CGCGCGTTGC GCAGCCTCAA TCCTTCTCCC TACATGTTTT ATTACCACCT CGGCAGCTTT
TACGTGGTGG GAGCCTCGCC CGAAATCCTG GTGCGACTCG AAGGTGAAAC GGTAACGGTA
CGCCCGATCG CAGGAACCCG TCCCCGGGGC AAGACCTTGG GGGAAGACGC TGCGCTGGCC
ACCGATCTGC TCGCCGACCC GAAAGAGCGG GCGGAGCATG TAATGCTGAT GGACCTGGGG
CGTAATGATG TAGGACGCGT GGCGCAGATC GGAACGGTAA AGGTGACAGA GAACATGCGT
ATCGAGCACT ACTCGCACGT CATGCATATT GTTTCCAATG TCGAAGGCCG GCTCAAGCCC
GGCCTCAATG CGATGGATGT GCTGCGCGCC ACCTTTCCGG CGGGGACCGT AAGTGGCGCG
CCCAAGGTGC GGGCAATGGA AATCATCGAT GAACTGGAAG TCTCGAAGCG TGGAATCTAT
GCAGGTGCAG TCGGATATCT TGGATTCAAC GGGGATATGG ATCTGGCTAT CGCCATCCGC
ACTGGAGTGA TCAAGGACGG CAAACTGCAT GTCCAGGCGG GTGCGGGCAT CGTTGCCGAT
TCCGTTCCGC AAAGCGAATG GGTGGAGACG CAAAACAAGG CCAGGGCCCT GCTGCGCGCG
GCGGAGATTG CGGAGAACGG GTTGGACAGC AGGATCGAAT GA
 
Protein sequence
MTEAEFHALA TKGYTHIPVV LETFADLDTP LSVYLKLANE PYSYLLESVQ GGERFGRYSV 
IGLPARNRIE VRGNDVSVID GSTSQVFQSE DPLAFVQSYL ARFKAAPYPG LPRFCGGLAG
YFAYDTVRYI ERRLAGGCAY RLPDTLDTPD ILLLVSEELA VVDNLSSKLY LIVYGDAALE
GAYSSARKRL KELLGSLREP LRIPLEMPCE SGAPVSQFAE ADFISAVERA KRYIFDGDIM
QVVLSQRTSK PYGASPLALY RALRSLNPSP YMFYYHLGSF YVVGASPEIL VRLEGETVTV
RPIAGTRPRG KTLGEDAALA TDLLADPKER AEHVMLMDLG RNDVGRVAQI GTVKVTENMR
IEHYSHVMHI VSNVEGRLKP GLNAMDVLRA TFPAGTVSGA PKVRAMEIID ELEVSKRGIY
AGAVGYLGFN GDMDLAIAIR TGVIKDGKLH VQAGAGIVAD SVPQSEWVET QNKARALLRA
AEIAENGLDS RIE