Gene Vapar_3062 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagVapar_3062 
Symbol 
ID7973782 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameVariovorax paradoxus S110 
KingdomBacteria 
Replicon accessionNC_012791 
Strand
Start bp3218467 
End bp3220359 
Gene Length1893 bp 
Protein Length630 aa 
Translation table11 
GC content72% 
IMG OID644793646 
Productpara-aminobenzoate synthase, subunit I 
Protein accessionYP_002944947 
Protein GI239816037 
COG category[E] Amino acid transport and metabolism
[H] Coenzyme transport and metabolism 
COG ID[COG0147] Anthranilate/para-aminobenzoate synthases component I 
TIGRFAM ID[TIGR00553] aminodeoxychorismate synthase, component I, bacterial clade 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGCCTTCGT TCGCGCTCCT CGACGACTGC GGCTCGACGC CCGCGCGGCC CACCAGCCGG 
CTGCACACCG GCTTCGTGCG CGAGCACCGC TGCGCGGATC CGGCCGAGCT CGATGCGGCC
TGGGCGCGCG TCGATGCCGA CCTGCGGCAG GGCCTGCACG CGGTGCTGCT GGCCGACTAC
GAATGGGGCG CCAGGCTGCA GCGCGCGGGG CATGCACGGC TTGCGACCGA CGATGCCTCG
GCGCTGCGCG TGCTGATGTT CCGCGACTGC GCCATGCTTT CGGCGGACGA GGTTGCCAGC
TGGCTGGAGC AGCTCGAGGC GAGCGAGGCC AACCCGCCGC AGCCCGTCGG CGTGATGGAC
CTGCTGCCCA GCATCGACGC GCCGGCCTTC ACCGAGGCCA TCGCGCGCAT TCACGGCGCC
ATCGCCGCGG GCGAGACCTA CCAGGTCAAC TTCACCTACC GGCTGCACGG CCAGGGCTAT
GGCTCGCCCG TGGCGCTGTA CCGGCGCCTG CGCGAACGCC AGCCCGTGGC CTACGGCGCG
CTGATCGCCC TGCCCGAAAG CAATTCAAAG GCGGGGGAGG GCGAGGCCAC CCACGTGCTC
TCGTGCTCGC CCGAACTCTT CCTGCGCCAC GATGCGGGCG TGCTCACCGC GCGGCCGATG
AAGGGCACCG CGGCCCGCGC CGATGCGCCC GAGGGCGACA GCGAAAGGGC GCGCCTGCTC
TCGGTCGACA CCAAGAACCG CGCCGAGAAC GTGATGATCG TCGACCTGCT GCGCAACGAC
ATCGGCCGCG TGGCGCAGAT CGGTTCCGTC AAGGTGCCGA CGCTCTTTGC CATCGAGCCC
TATGCCACGG TGTTCCAGAT GACATCGACC GTAGAGGCGC GGCTGCGCGC CGGCGTCGGC
ATGCCCGAAC TGCTGCGCGC GGTGTTCCCC TGCGGCTCCA TCACGGGTGC GCCCAAGCAC
CGCACCATGG AATGGATCGG CGAACTCGAG AGCACGCCGC GCGGCCTCTA CTGCGGCGCG
ATCGGCTGGA TCGATGCGCC CGCGGAGGGC GCGGCCATCG GCGACTTCTG CCTGTCGGTC
GCCATCCGCA CCCTCACGCT GGGCGCGCAG CAGAAGGGCC TGCGGCCGCT GCGGCTGGGC
ATTGGCGCGG GCATCGTGCA GGACAGCGTT GCGGCCGACG AATTCGAGGA GTGCCTGCTC
AAGGCACGCT TTCTCACCGC ACTCGATCCG GGCTTCGAGC TGTTCGAGAC CATGCTGGCC
ACGCCCGGTG AGGGCATCCG CTACCTGGAC CGGCACCTGG CGCGGCTGGC GCACAGCGCA
CGCGCGCTGG GCTTCCGGCT CGACCGCGAT GCCGCGATGC AGGCGCTGCG GGAGGCCTTG
CCCGCGCTCG CACCGGGCCA GCCTTCGCGC CTGCGGCTCG CGCTGGCGCA CGGCGGGCGC
ATCGGCATCA CGCATTCGCC GCTGCCGCCC TTGCCGCCGG GCGCAGTGAA GCTGCTGATC
GCCGACCAGC GGCTGCCCAA TGCGAACCCG CTGGCGGCCC ACAAGACCAC CGTGCGCCAG
CACTTCGACG CCGGCGTGCG CGCGGCCGAA CGCGCCGGCG CCTTCGACAG CCTGTTCTTC
ACCGAAGACG GCCGACTTGT CGAAGGCGGG CGCAGCACGG TGTTCGCGCG CATCGGCGGA
CGCTGGTGGA CGCCGCCCGT TTCCGATGGC GCGCTGCCCG GCGTGATGCG CGGCGTGCTG
ATCGAAGACT CGATCTGGGA GGCCACCGAA CGCAGCCTGT CCCGTGAAGA CCTGGAGGCC
GCGCAGAAGC TCGTGGTATG CAATGCGCTG CGCGGCGTAC TGCCGGCGCG GCTGGTTCAG
CAAGCGCAGC CGCACGAGGC GGCCGCCGCT TGA
 
Protein sequence
MPSFALLDDC GSTPARPTSR LHTGFVREHR CADPAELDAA WARVDADLRQ GLHAVLLADY 
EWGARLQRAG HARLATDDAS ALRVLMFRDC AMLSADEVAS WLEQLEASEA NPPQPVGVMD
LLPSIDAPAF TEAIARIHGA IAAGETYQVN FTYRLHGQGY GSPVALYRRL RERQPVAYGA
LIALPESNSK AGEGEATHVL SCSPELFLRH DAGVLTARPM KGTAARADAP EGDSERARLL
SVDTKNRAEN VMIVDLLRND IGRVAQIGSV KVPTLFAIEP YATVFQMTST VEARLRAGVG
MPELLRAVFP CGSITGAPKH RTMEWIGELE STPRGLYCGA IGWIDAPAEG AAIGDFCLSV
AIRTLTLGAQ QKGLRPLRLG IGAGIVQDSV AADEFEECLL KARFLTALDP GFELFETMLA
TPGEGIRYLD RHLARLAHSA RALGFRLDRD AAMQALREAL PALAPGQPSR LRLALAHGGR
IGITHSPLPP LPPGAVKLLI ADQRLPNANP LAAHKTTVRQ HFDAGVRAAE RAGAFDSLFF
TEDGRLVEGG RSTVFARIGG RWWTPPVSDG ALPGVMRGVL IEDSIWEATE RSLSREDLEA
AQKLVVCNAL RGVLPARLVQ QAQPHEAAAA