Gene Pnec_0162 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPnec_0162 
Symbol 
ID6184129 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePolynucleobacter necessarius subsp. necessarius STIR1 
KingdomBacteria 
Replicon accessionNC_010531 
Strand
Start bp141518 
End bp143023 
Gene Length1506 bp 
Protein Length501 aa 
Translation table11 
GC content46% 
IMG OID641670886 
Productanthranilate synthase component I 
Protein accessionYP_001797085 
Protein GI171462972 
COG category[E] Amino acid transport and metabolism
[H] Coenzyme transport and metabolism 
COG ID[COG0147] Anthranilate/para-aminobenzoate synthases component I 
TIGRFAM ID[TIGR00564] anthranilate synthase component I, non-proteobacterial lineages 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value0.487235 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones74 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAGCGTG AAGAATTTAA TGCCCTAGCA AAAGAGGGCT TCAATCGTAT TCCCCTCATT 
AAAGAGGTGC TAGCAGATCT TGAAACACCT CTGCCCCTTT ACGTTAAGCT CAGCCAAGCA
TTTGGAAAGA AGAATGCTTA CCTATTGGAG TCCGTTTTAG GCGGTGAGCG TTTCGGCCGC
TTCTCCTTTA TTGGCCTGCC TGCCAGGACA ATCGTGAGAA CTGTTGGAAC ACCTACTGCG
CCAATCAATG AAGTGCTTAC GGACGGAATA ATTGTTGAAA GCAATACCGA CAATCCACTC
GACTTTGTTG ATGCTTATTT CAAACGCTTT AAGGTTGCAC TACAGCCTGA TATGCCTCGT
TTTTGCGGCG GCCTAGCTGG TTACTTTGGT TATGACACTG TTCGTTACAT CGAATCACGT
CTAGCCAATC ATCAACTTCC AGACGAACTT GGCATTCCTG ATATTCAACT CATGTTGACT
GAAGAGTTGG CAGTAATTGA TAACGTTGCA GGAAAAATTT ATTTCATTGT TTATGCAGAC
CCCAACGTTG CCGATAATTT CGAAAGGGCT CAAGAGCGCC TAAAAGAATT AATGGCTTGT
CTTGGTAAGC CAGCAAATAT ACCAGCGTCT TTACCAAGCA CGAAAACAGA ACTCATTCGC
AAATTTAAGG CTGCAGATTT TGAAAATGCA GTCCTTAAAA CCAAAGAATA TATTTTGGCT
GGTGACTGCA TGCAGGTTGT GATTGGTCAA CGCATTAGCA AGCCATTCAC AGACTCGCCC
TTAGCGCTCT ACAGAGCCTT ACGCTCTCTC AATCCATCGC CGTATATGTA TTTCTACGAC
TTTGGCGACA TGCAAATCGT TGGTTCATCT CCCGAGATCT TGGTGCGCCA AGAAAAGCGT
GCTGCAGAGA AAATTGTGAC GATACGTCCG CTTGCCGGAA CTCGTCCCCG TGGAGCAAAT
CCAGAAGAAG ATGAGTGCTT GGCCAAAGAA CTCTTAGCGG ACCCCAAAGA AATCGCTGAA
CACGTCATGC TGATTGATTT AGCCCGAAAT GACGTGGGAC GCATTGCAAA AACGGGCTCA
GTGAAGGTAA CTGACTCCAT GTCTATCGAG AAGTACTCAC ATGTTCAACA TATTGTGAGC
TCGGTAGAAG GTGATCTTTT AGACAACATG AGCAATATGG ACGTATTGCG AGCCACTTTC
CCAGCGGGCA CCTTATCAGG CGCCCCAAAA ATTCGGGCAA TGGAAATCAT TGATGAGATG
GAAATTGTGA AGCGCGGTGT ATATGGTGGC GCAGTTGGCT ATCTTTCATT CTCTGGAGAT
ATGGATGTAG CGATTGCTAT TCGTACAGGC GTGATCCGGG ATGGCATATT GCACTCTCAG
GCAGGTGCAG GTGTTGTAGC CGACTCTGAT CCGACTGCTG AATGGAAAGA AACAGAAGCA
AAAGCACGCG CAGTATTGAC TACCGCAGAT CTAGTACAAG GAGGTCTTGA TGCTCCTAAT
GATTGA
 
Protein sequence
MQREEFNALA KEGFNRIPLI KEVLADLETP LPLYVKLSQA FGKKNAYLLE SVLGGERFGR 
FSFIGLPART IVRTVGTPTA PINEVLTDGI IVESNTDNPL DFVDAYFKRF KVALQPDMPR
FCGGLAGYFG YDTVRYIESR LANHQLPDEL GIPDIQLMLT EELAVIDNVA GKIYFIVYAD
PNVADNFERA QERLKELMAC LGKPANIPAS LPSTKTELIR KFKAADFENA VLKTKEYILA
GDCMQVVIGQ RISKPFTDSP LALYRALRSL NPSPYMYFYD FGDMQIVGSS PEILVRQEKR
AAEKIVTIRP LAGTRPRGAN PEEDECLAKE LLADPKEIAE HVMLIDLARN DVGRIAKTGS
VKVTDSMSIE KYSHVQHIVS SVEGDLLDNM SNMDVLRATF PAGTLSGAPK IRAMEIIDEM
EIVKRGVYGG AVGYLSFSGD MDVAIAIRTG VIRDGILHSQ AGAGVVADSD PTAEWKETEA
KARAVLTTAD LVQGGLDAPN D