Gene Nwi_0951 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNwi_0951 
Symbol 
ID3674968 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrobacter winogradskyi Nb-255 
KingdomBacteria 
Replicon accessionNC_007406 
Strand
Start bp1043589 
End bp1045706 
Gene Length2118 bp 
Protein Length705 aa 
Translation table11 
GC content60% 
IMG OID637712500 
Productpara-aminobenzoate synthase, component I 
Protein accessionYP_317565 
Protein GI75675144 
COG category[E] Amino acid transport and metabolism
[H] Coenzyme transport and metabolism 
COG ID[COG0147] Anthranilate/para-aminobenzoate synthases component I
[COG0512] Anthranilate/para-aminobenzoate synthases component II 
TIGRFAM ID[TIGR00553] aminodeoxychorismate synthase, component I, bacterial clade
[TIGR00566] glutamine amidotransferase of anthranilate synthase or aminodeoxychorismate synthase 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.754558 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.679604 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGGACGC TGATAATCGA CAACTACGAT TCATTCACCT ACAACTTGGT GCACTTGATC 
GCAGATATAA ACCAGGAAGA TCCCCTGGTC GTCCGCAACG ATGCCTTGTC ATGGGACGAG
CTCTCCGGAG AACGATTCGA TAATATCGTT ATCTCGCCTG GTCCGGGACG ACCCGACCGG
GCAGCGGATT TCGGCTTATC GAAGGCTGCG ATCGAATCGG CCACTGCTCC CTTGCTTGGC
GTGTGTTTAG GACATCAGGG TCTCGCCCTC GCGGCAGGCG CGTCGTTGGC GCGGGCTCCC
TCGCTCATCC ACGGGCACAC ATCCAGAATT GCGCATCAGG GATCCGGCCT TTTCGCCAAC
ATGCCGCCCT GGTTCAACGC GGCCCGCTAT CACTCTTTTG TAGTCCAGCG CCCCCTTCCC
TCCACTCTTG AGGAAATCGC CTGGACCGAG GATGGCCTCA TCATGGGCAT CGCCCGTCAC
GGCCGCCCGC AATGGGGCGT TCAATTTCAT CCGGAATCAT TTCTCACCGA GGGCGGCAAG
GTCCTGCTAC GAAACTTTCG CGATCTTTCG TTACGCTTCA CAGGAAGAAC CTCAACACCG
CCGCCACCAC GGACGAACAA GCGGCGCGCC GCGCCGCCTG TTGCGACGGC CCGCAAAGCA
TTCTGGTTCG AGATCCCGCG CGCGATCGAC ACCGAGGCGG TTTTCTGTTC ACTCTTTGCT
GATCAGCCCT TCGCCTTCTG GCTCGACAGC AATCTTGCCG GATCCCCGCT GCCGCAATGG
TCATACATGG GAGACGCATC AGGACCCCAT GCGGCGACGG CGCAGTATCG CAGCATCGAC
CGGCAGATCA TTATCGAGGA TGCTCACGGC CGCCGAACAG AGAACGCCGA TCTATTCGAG
TACCTTCAAA GGGAGCAACT GGGCCGGCCG CAATCGGCGC CGCCCTGTCC TTTCGCTGGC
GGTCATGTCG GATGGTTTAG TTACGAGCTA CGACATGATT GCGGTTCGCC GACCACGCGG
CGGGCTGCAA CACCCGACGC CTTATGGATC CGCCCGGACA GGTTCATCGC CGTCAACCAT
CGTGATGGGA CAAGTTACCT ATGCGCGATC GACAGTCCTG AGGAAGCCGC TCGCGCGCAG
CACTGGATTC GCTCCACGCT CAAGCGGATC GAATCGGCGA GACGCCCGCC CATGAATTCT
CCCGACGCGA TCGACCCTAA CCCACTCTCG TTCCTCATGA AGGATGGAAG ATCGGACTAT
CTTTCCAAGG TCACCCGTTG TCTCGACCTG ATCGCGCAAG GGGAGACTTA TCAGGTCTGT
TTGACCAGCG AGTTGTCCTG CTCTGCCACG GTCGAACCGC TGCGGGTTTA TCGCGCAATG
AGGAGCGTCA ATCCCGCCCC GTTTGCTGCT TTCATCAAAT GGCCCGGCGG AGCCATCCTC
AGCGCCTCGC CGGAGCGATT CCTTGCGGTC GATACCGAGG GCAACATCGA AGCAAAGCCG
ATCAAGGGAA CGATCCGGCG CGCCACCGAT CCGGTGGAGG ACAAGAAGCT GGCCGAAATG
CTGCGGGCCG ATCGGAAGAA CCGGGCCGAG AACGGAATGA TCGTCGATCT CCTGCGCCAC
GATCTGTCGC GCTGTTGCGA GACCGGAACC GTATCAGTCT CGCGCCTGTT TGACGTCGAA
ACATATCAGA CCGTCCACCA ACTGGTGAGT ACGATACGCG GCGTTCTGAA GCCGGGGCAT
ACGCTCATCG ATGTGCTGCG CGGCGGCGCG TTTCCGGGCG GATCAATGAC GGGAGCGCCG
AAATTCCGCA CTGTGGAACT TATCGATCAG TTGGAGCAGC GCGCCCGAGG CATCTATTCC
GGCTCACTTG GCTGGCTCGG CGACGACGGC GCTGCTGATC TAAGCATCGT GATCCGATCC
ATCGTACTGG CTGATGGGCG GCTCTCCATC GGCGTGGGAG GCGGCGTGGT CGCGGAATCC
ACGCCCGAGG GAGAATTTGA GGAGATGCTC CTGAAGGCCG AGGCTTCGAT CAAATCGATC
GTCCTCGCCA CCTTTGGAAG CTTCAGCGAA AGTCAGTATC GCCTGGTGGA ATCCGGCGAT
CGCATGGCTG AAGGCTGA
 
Protein sequence
MRTLIIDNYD SFTYNLVHLI ADINQEDPLV VRNDALSWDE LSGERFDNIV ISPGPGRPDR 
AADFGLSKAA IESATAPLLG VCLGHQGLAL AAGASLARAP SLIHGHTSRI AHQGSGLFAN
MPPWFNAARY HSFVVQRPLP STLEEIAWTE DGLIMGIARH GRPQWGVQFH PESFLTEGGK
VLLRNFRDLS LRFTGRTSTP PPPRTNKRRA APPVATARKA FWFEIPRAID TEAVFCSLFA
DQPFAFWLDS NLAGSPLPQW SYMGDASGPH AATAQYRSID RQIIIEDAHG RRTENADLFE
YLQREQLGRP QSAPPCPFAG GHVGWFSYEL RHDCGSPTTR RAATPDALWI RPDRFIAVNH
RDGTSYLCAI DSPEEAARAQ HWIRSTLKRI ESARRPPMNS PDAIDPNPLS FLMKDGRSDY
LSKVTRCLDL IAQGETYQVC LTSELSCSAT VEPLRVYRAM RSVNPAPFAA FIKWPGGAIL
SASPERFLAV DTEGNIEAKP IKGTIRRATD PVEDKKLAEM LRADRKNRAE NGMIVDLLRH
DLSRCCETGT VSVSRLFDVE TYQTVHQLVS TIRGVLKPGH TLIDVLRGGA FPGGSMTGAP
KFRTVELIDQ LEQRARGIYS GSLGWLGDDG AADLSIVIRS IVLADGRLSI GVGGGVVAES
TPEGEFEEML LKAEASIKSI VLATFGSFSE SQYRLVESGD RMAEG