Gene Vapar_5806 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagVapar_5806 
Symbol 
ID7974927 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameVariovorax paradoxus S110 
KingdomBacteria 
Replicon accessionNC_012792 
Strand
Start bp510914 
End bp513112 
Gene Length2199 bp 
Protein Length732 aa 
Translation table11 
GC content69% 
IMG OID644796387 
Productpara-aminobenzoate synthase, subunit I 
Protein accessionYP_002947661 
Protein GI239820476 
COG category[E] Amino acid transport and metabolism
[H] Coenzyme transport and metabolism 
COG ID[COG0147] Anthranilate/para-aminobenzoate synthases component I 
TIGRFAM ID[TIGR00553] aminodeoxychorismate synthase, component I, bacterial clade
[TIGR00566] glutamine amidotransferase of anthranilate synthase or aminodeoxychorismate synthase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00431194 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGCACGC TGTTGATCGA CAACTACGAC TCGTTCACCT ACAACCTCTA CCAGTACCTG 
GCGGCCTGCA ACGGCGAGGC CCCGATCGTG CTGCGCAACA ACGAGATGCA CTGGGACGAG
GTCGAGCGCA TGCACTTCGA CAACATCGTG ATCTCGCCGG GCCCCGGACG TCCGCAGCGG
CGCGCCGACC TCGGCATCTC GGCGGACGCA CTGCAGCGTT CGCGCAAACC CGTGCTCGGT
GTGTGCCTGG GCCACCAGGC CATCGCCCAC CATTGCGGCG CCCGTGTCGA CCTGGCGGTG
CGCCCGATGC ACGGACGCCT GGACCGCATC TCGCACGCGC AGCGCGATCT CTTCGCCGGC
ATCCCGGACC GCTTCGAGGC GGTGCGCTAC CACTCGCTCG CAGTCACGCA GCTGCCCTCA
TCCCTGGAGC CGCTCGCCTG GACGTCCGAC GGCACATTGA TGGGACTGCG TCACGTCTCC
CTGCCCTGGT GGGGGGTGCA GTTCCATCCG GAGTCGATCT GCACCGAATT CGGCGCACGG
CTGCTGCGCA ACTTCAGGGA CCTGACGCTG GAGTGGCTCG CACAGACGCC CCGGCGCGAG
GCACGGGATC GCTTCGGACC CGCGCTGCGT TTCGAAGGCC CGCAGGCGTT GTCGCTGCAC
GCGGAGTGCA TCGACACCGC AGCCGATGCC GAGGCCGTGT TCATGCGCAC GTTCGCACAG
CGCGACTGCG TTTTCTGGCT GGACAGCAGC CTCGCCGAGC CCGGGCGGGC GCGCTTTTCC
TTCATGGGCG ACGCGAGCGG CCCGCGTGCC CAGGTGCTGA ACTACCGCAC CGCCCCGCGC
GAACTGATGG TGCGCCGGGG CGACGTCGTC CAGGTGCGCG ACGAGAGCAT CTTCGACTAC
CTGCGCGACC ACCTGCCGGG CCAGCCGCCT GCGGGTCCCC CGCTCCCCTT CGACTTCGCG
TGCGGCTTCG TGGGCTACTT CGGCTATGAG CTCAAGCGGG AGCTGGACGG CACGCATGCC
CACGACGCAG ACACCCCGGA CGCGATGTGG ATCCTCGCCG ATCGCTTCCT CGCCTTCGAT
CACGCCGAGC AACAGGTCTG GATCGTCTGC TTGGATGAGG CCGTGCCGTC CAATGAGAAT
CGCCGCTGGA TGCAGACCAT GCGCGCGCGG CTGGCAGGCG CACCGGCGAC CGCGCCGGAC
GACAGCGCGG CCGCGGCCCG CGATCTGCAA CCATTGGCCT GGCGGATCGA CCTCGCGATC
TACCGGGAGC TGATCCTGCG CTGCCAGCAC GAGATCCTGC AGGGCGAGAC CTACGAGGTC
TGCCTCACGA ACCAACTGGT CGGCGAAGGG CGCGTCCATC CGCTGGAGAT CTACCGGATC
CTGCGCAGGC ACAACCCTGC CCCCTATGCG GCCTACCTGC GCATCGGCGG CTATGCCGTG
CTGTGCGCCT CGCCCGAACT CTTCCTGCAC ATCTCCAGCG CCGGCGATGT GGAATCGAAG
CCCATCAAGG GCACCGCGGC GCGCGGCGCG ACGCCGGCCG AGGACCAGGC GATCGCGGAC
GCCATGGTCG CCGACGAGAA GACCCGCGCG GAGAACCTGA TGATCGTCGA CTTGCTGCGC
AACGACCTCA ACCGGGTCTG CGAGATCGAC AGCGTTCACG TGCCGAGCCT GTTCGCGGTC
GAGACCTATG CCACCGTGCA CCAGCTGGTC AGCACGATCC GGGGGCGGCT GCGCGCGGAC
CTCGGAGCGG TCGACTGCAT CCGCGCCGCG TTCCCCGGCG GCTCCATGAC GGGTGCGCCG
AAGGTGCGCA CGATGAAGAT CCTGGACGAA CTCGAGGCGG CGGCCCGCGG CATCTACTCG
GGCAGCATCG GCTACCTATC GCTCAACGGC GCGGCGCAGC TCAACATCGT GATCCGCACC
CTGGTGTCGG CCAACGGCCG CGTCTCCATC GGCGCCGGTG GCGCCATCGT CGCCATGTCC
GATCCTGACG CCGAGGTCGA GGAGATCGTT CTGAAGGCGC AGGCGCTGTG GGACGTCCTC
AGGCGTTGCG GCGCACCATT CGGGGCGCGG CCGGGGACGG CGCCGGATGC CCGTGCCGCG
GGCGATCCCG GGGTTCCGGG GCCTCCGGGG TCGCCGTCCG CGCCCGAGAT TCCGCAAAGC
CCTTCTACGC GGTCACCGCC TGCGTCCGGC AATGGCTAG
 
Protein sequence
MRTLLIDNYD SFTYNLYQYL AACNGEAPIV LRNNEMHWDE VERMHFDNIV ISPGPGRPQR 
RADLGISADA LQRSRKPVLG VCLGHQAIAH HCGARVDLAV RPMHGRLDRI SHAQRDLFAG
IPDRFEAVRY HSLAVTQLPS SLEPLAWTSD GTLMGLRHVS LPWWGVQFHP ESICTEFGAR
LLRNFRDLTL EWLAQTPRRE ARDRFGPALR FEGPQALSLH AECIDTAADA EAVFMRTFAQ
RDCVFWLDSS LAEPGRARFS FMGDASGPRA QVLNYRTAPR ELMVRRGDVV QVRDESIFDY
LRDHLPGQPP AGPPLPFDFA CGFVGYFGYE LKRELDGTHA HDADTPDAMW ILADRFLAFD
HAEQQVWIVC LDEAVPSNEN RRWMQTMRAR LAGAPATAPD DSAAAARDLQ PLAWRIDLAI
YRELILRCQH EILQGETYEV CLTNQLVGEG RVHPLEIYRI LRRHNPAPYA AYLRIGGYAV
LCASPELFLH ISSAGDVESK PIKGTAARGA TPAEDQAIAD AMVADEKTRA ENLMIVDLLR
NDLNRVCEID SVHVPSLFAV ETYATVHQLV STIRGRLRAD LGAVDCIRAA FPGGSMTGAP
KVRTMKILDE LEAAARGIYS GSIGYLSLNG AAQLNIVIRT LVSANGRVSI GAGGAIVAMS
DPDAEVEEIV LKAQALWDVL RRCGAPFGAR PGTAPDARAA GDPGVPGPPG SPSAPEIPQS
PSTRSPPASG NG