Gene Plav_3169 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPlav_3169 
Symbol 
ID5455164 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameParvibaculum lavamentivorans DS-1 
KingdomBacteria 
Replicon accessionNC_009719 
Strand
Start bp3381686 
End bp3383194 
Gene Length1509 bp 
Protein Length502 aa 
Translation table11 
GC content64% 
IMG OID640878758 
Productanthranilate synthase component I 
Protein accessionYP_001414432 
Protein GI154253608 
COG category[E] Amino acid transport and metabolism
[H] Coenzyme transport and metabolism 
COG ID[COG0147] Anthranilate/para-aminobenzoate synthases component I 
TIGRFAM ID[TIGR00564] anthranilate synthase component I, non-proteobacterial lineages 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value0.276011 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones44 
Fosmid unclonability p-value0.09914 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATTTCGC CCGACTACGC CAGTTTCGAG GCGTCGTACA ACGCGGGTAC GGCGCAGGTG 
GTCTATGCGC GTCTCGTCGC GGACCTCGAA ACGCCTGTCT CCGCCATGCT CAAGATCGCG
GACGGAAAGC CGAACAGCTT CCTGCTCGAA TCCGTCGAGG GCGGCGACAG CCGCAACCGC
TATTCGATCA TCGGCTTCGC GCCGGATGCC ATCTGGCGGA CGCGGGGCGA CAAGGCGGAG
CTGAACCGCA AGGCCCGCTT CGGGGATACT TACGAGCCCT GCCCCGGCGG CGCGCTGAAG
AGCCTGCGTG CCTTCATCGA GGAAAGCCGC ATCGACCTGC CGGAGGAGCT GCCGCCGATG
GCCGCCGGCG TCTTCGGCTA TATGAGCTAC GACACGGTGC GCCTGATGGA GCACCTGCCG
AACGAGAACC CGGACGCGCT CGGCTTGCCG GACGGCATCT TCATCCGTCC AACGATCATC
GCCGTCTTCG ATTCCGTGAA GGACGAAGTT ACCGTCGTCA CGCCCGTGCG GCCGGAGCCC
GGCGTCAGCG CACATGCCGC CTATACGCGC GCGAGCGAAC GTGTCGGCGA TGTGATCGCC
GAGTTCGATA CGGCGCTGCC GCATCTCCAC CGCGATGCGG AAATCGGGCC GCTCGAAGAG
CCGTGCTCCA ACACGCCCAA GGATGCCTAT TTCGGCATGG TCGCCCGTGC GAAGGAATAC
ATCGCCGCCG GCGACATTTT TCAGGTCGTG CTCTCGCAGC GTTTCGAGGC GCCGTTCGAG
CTGCCGCCCT TCGCGCTCTA CCGGGCGCTC CGCCGCATCA ACCCGTCGCC CTTCCTTTAT
TTCCTGAATT TCGAGGATTT CTCCATCGTC GGCTCGAGCC CGGAAATTCT CGTCCGCGTG
CGGAACAACC GCGTCACCAT CCGCCCTATC GCGGGCACGC GCCATCGCGG AAAGAACAAG
GCGGAAGACG AGGCGATCTC CGAAGAACTC CTCGCCGATC CGAAGGAGCG CGCCGAGCAC
CTCATGCTGC TCGATCTCGG GCGCAACGAT GTCGGCCGCG TCTCGAAGAT CGGTTCGGTC
GATGTGACGG AACGTTTCGC GCTCCAGTAC ACATCGCACC TCATCCACAT CGTCTCGAAC
GTCGAAGGCG ACCTCGATCC CGCCTATGAC GCGATTTCCG CCCTTGTCGC GGGCTTCCCG
GCAGGCACCG TTTCCGGCGC GCCGAAAGTC CGCGCGATGG AAATCATCGA CGAACTCGAA
CTCGAAAAGC GCGGCCCCTA TGCCGGCTGC GTCGGCTACT TCTCGGCGGC GGGCGAAATG
GACACCTGCA TCGTGCTCCG CACCGCCATC GTCAAGGACG GCAAGATGTA CGTCCAGGCA
GGCGGCGGCG TGGTCGCGGA TTCAAGCCCC GAAGGCGAGT ATCAGGAAAG CGTCAACAAG
GCCAAGGCCC TCTTCCGCGC GGCGGAAGAA GCCGTGCGCT ACGCATCGCA GGTGGGGAAA
AGGCAGTAA
 
Protein sequence
MISPDYASFE ASYNAGTAQV VYARLVADLE TPVSAMLKIA DGKPNSFLLE SVEGGDSRNR 
YSIIGFAPDA IWRTRGDKAE LNRKARFGDT YEPCPGGALK SLRAFIEESR IDLPEELPPM
AAGVFGYMSY DTVRLMEHLP NENPDALGLP DGIFIRPTII AVFDSVKDEV TVVTPVRPEP
GVSAHAAYTR ASERVGDVIA EFDTALPHLH RDAEIGPLEE PCSNTPKDAY FGMVARAKEY
IAAGDIFQVV LSQRFEAPFE LPPFALYRAL RRINPSPFLY FLNFEDFSIV GSSPEILVRV
RNNRVTIRPI AGTRHRGKNK AEDEAISEEL LADPKERAEH LMLLDLGRND VGRVSKIGSV
DVTERFALQY TSHLIHIVSN VEGDLDPAYD AISALVAGFP AGTVSGAPKV RAMEIIDELE
LEKRGPYAGC VGYFSAAGEM DTCIVLRTAI VKDGKMYVQA GGGVVADSSP EGEYQESVNK
AKALFRAAEE AVRYASQVGK RQ