Gene Plav_1564 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPlav_1564 
Symbol 
ID5453446 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameParvibaculum lavamentivorans DS-1 
KingdomBacteria 
Replicon accessionNC_009719 
Strand
Start bp1702112 
End bp1703521 
Gene Length1410 bp 
Protein Length469 aa 
Translation table11 
GC content63% 
IMG OID640877137 
Productpara-aminobenzoate synthase, subunit I 
Protein accessionYP_001412840 
Protein GI154252016 
COG category[E] Amino acid transport and metabolism
[H] Coenzyme transport and metabolism 
COG ID[COG0147] Anthranilate/para-aminobenzoate synthases component I 
TIGRFAM ID[TIGR00553] aminodeoxychorismate synthase, component I, bacterial clade 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value0.328614 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones65 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACAGAGA TCGGATGGCG CGCGCCGCAT GAACTCTTTC ATGCGCTGGC GGCGGAGCCT 
TTCGCCCTGC TTCTCGACAG CGCGACAGCC GGGCCGGACC GCGACCCTGC ATTGCATGGC
CGCTGGAGCT TTATCGCATT CGATCCCTTC GACCGGCTGG TTCTGCGTCC AGGCGAGATC
GCCAATCCTT TCGCGGCGCT GAAGACCAAG CTCGCGGGTT TCGCCGCACA GGCGCCCTCG
CCCTCCTTGC CGCCGTTTTC CGGCGGGGCA GCCGGTTTCT TCGGCTACGG GCTCGGACGC
ACGCTGGAGC GCCTGCCGCC TGAGGAAAAG CCCTTCGCGA TCGACGACCA GCACCTTCCG
GACATGGCGC TCGGTTTCTA CGACGTCGTC CTGGCTTTCG ATCTTTGCGC ACGTCGCGCC
TTCATCATCT CATCCGGTTT GCCGGGAGGC GAACCGGACC GGGCAAAACG GCGCCTGACC
GAAATCCGCT CCCGCATCGA GGAACTGCCG AGCATTAGCC GCGCGGCGTC GGCCGGTACT
GGGCAAATAG TCTCGAATTT CTCGCGGCAG GCTTATGAAA AAGCCGTGTC GCGCGTCATC
GACTACATCC ATGCCGGGGA TATTTTTCAG GCAAACCTCT CGCAACGCTT CGAGGCACGG
CTGTCGCCGG AGGATGATGC TTATTCGCTC TACCTGCGGC TGCGAGCCGC AAGCCCCGCG
CCTTTCTCAT CCTTCTTCAA TTTTGGAGAG GGAGCGATCG TGTCTTCGTC GCCGGAACGC
TTCCTCGCCT GCAGCGGCGG CGCCGTCGAG ACAAAGCCCA TCAAGGGCAC GCGGCCGCGC
GGACAAACAC CGGAGGAAGA CCGCCGCCTC GCGGCCGAAC TCCTGAAATC GGAGAAAGAT
CGCGCGGAAA ACGTGATGAT CGTCGATCTC TTGCGGCATG ATATATCCCG CGTTTGCGCC
GATCATTCCG TGGTTGTCGA GAAGCTCTGC GAACTCGAAA GCTTTGCCAA TGTGCATCAT
CTCGTATCGA CGGTGCGGGG ACAATTGAGA GAGGGCGAAA CGCCGGCCGA TCTCCTTGCG
GCCTGCTTCC CTGGCGGCTC CGTCACCGGC GCACCGAAAA AACGCGCGAT GGAGATCATC
GCGGAGCTTG AGCCGACGAC GCGCGGACCT TATTGCGGCG CCGTCGGTTA TCTCGGCGCG
AATGGCGGCA TGGACACCGC CATCTCGATC CGCACCTTGG TCGTGAAGGG AGACCGCGTT
ACCTTTCAGG CCGGAGGAGG AATTGTGGCG GATTCCGATC CCGCCTCGGA GTATGAGGAA
ACACTTGCCA AGGCGCGCGA TATGCGGCGC GCGCTCGGCG CCGCACCCGA AGAAGCTGAA
AGCCCCGCCT TGTCCGGAGC GCATGCATGA
 
Protein sequence
MTEIGWRAPH ELFHALAAEP FALLLDSATA GPDRDPALHG RWSFIAFDPF DRLVLRPGEI 
ANPFAALKTK LAGFAAQAPS PSLPPFSGGA AGFFGYGLGR TLERLPPEEK PFAIDDQHLP
DMALGFYDVV LAFDLCARRA FIISSGLPGG EPDRAKRRLT EIRSRIEELP SISRAASAGT
GQIVSNFSRQ AYEKAVSRVI DYIHAGDIFQ ANLSQRFEAR LSPEDDAYSL YLRLRAASPA
PFSSFFNFGE GAIVSSSPER FLACSGGAVE TKPIKGTRPR GQTPEEDRRL AAELLKSEKD
RAENVMIVDL LRHDISRVCA DHSVVVEKLC ELESFANVHH LVSTVRGQLR EGETPADLLA
ACFPGGSVTG APKKRAMEII AELEPTTRGP YCGAVGYLGA NGGMDTAISI RTLVVKGDRV
TFQAGGGIVA DSDPASEYEE TLAKARDMRR ALGAAPEEAE SPALSGAHA