Gene Vapar_2069 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagVapar_2069 
Symbol 
ID7972471 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameVariovorax paradoxus S110 
KingdomBacteria 
Replicon accessionNC_012791 
Strand
Start bp2214098 
End bp2215654 
Gene Length1557 bp 
Protein Length518 aa 
Translation table11 
GC content70% 
IMG OID644792662 
Productextracellular solute-binding protein family 5 
Protein accessionYP_002943976 
Protein GI239815066 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACATTGC CAATCCATCT GACGCGCCGC CGTGCGCTCC AACTTTGTGC GGTCGGCCCG 
GCCGTTGCCG CATCGGCGGG CCTGCTGGCG CAGCCGGCGC TCTCGCCGCT GCAGATCGTC
GGCCCCTGGG AATTAGGGGG CCTCGCCCCG GCCAACAGTG GCTACATCTT CACGCGCATG
CAGATTGCTG AAACGCTGAT GGAGGCGCGC GAGGACGGCA CCCCGTTGCC TGGCTTGGCT
GAACGCTGGG GAGTGTCGGC CGATGGGTTG GCATGGCGTT TCACGCTGCG TGCGACAGCG
CGCTTTCACG ACGGCACGCC AGTGACAGCC GCCGCAGTCG TGCGCTGCCT GCAGGCCGCA
CGCGTGGCCC CGGCGCTGCT CAGTCTGGCG CCGATCAAGT CGCTGGACGC CGAGGGTGCA
GGTGTCGTCT TGATCAGGCT GGCGTCGCCC TACGGCGGAT TGCCAGCGCT GCTGGCGCAC
AGCAGCACGA TGGTGCTCGC CCCGGCGAGC TACGGTCCCG ACGGCAGGGT GCGGACCATC
GTTGGCAGCG GCCCCTACCG CGTGGTGTCG CTCGCACCGC CCCAGCATGT GGAGGCGGCT
GCGTTCGACG GCTACGACGG CGCCAGGCCT GCCGTCGAGC GCGTGCGTTA CCTGGCGGCC
GGCCGCGCCG AAACCCGTGC GCTGATGGCC GAGGGCGGAC AGGCCGATCT GGCCTACGGG
CTCGACCCGG CAAGCCTCGT GCGACTGCGC AAGCGCGGCC AGGTTCGCGT CGATACGGTG
ACGTTGCCAC GCACCGTGAT CCTCAAAGTC AATGCAGGTC TGCCGGCCTT GAAAGACCTG
CGCGTGCGGC AGGCGCTCAG CCTGTGCATC GATCGGGCCG GCATTGCGAA GGCCTTGCTG
CGTGACCCCG AGCTGGCGGC AACGCAACTC TTTCCTCCGA CGCTCAAGGC CTGGCACGAC
CCCGCGCTGG CGCCATTGAC CCACGACCCC GCCGCTGCGG CGAGGCTTCT GGCCGAGGCC
GGCTGGCGAC GGGCGGCCGA CGGCTTGCGC GACGCTTCGG GCCAGCCATT ACGCCTGTCG
CTGCGGACCT TTCCCGATCG GCCGGAGTTG CCCGTCATCG CCTCCGCGCT GCAGGAGCAG
TGGCGACAGG CTGGCATCGC GGTGCAGGTC GGCGTCGGCA ATTCGGGCGA CATTCCGCTG
GGCCACCGCG ACGGCAGCCT GCAACTGGGC CTGGCCGCGC GCAACTACGC CACGGTGCCC
GACCCCACGG GCACGCTGAT GCAGGATTTC GGGGCCTCGG GGGGCGACTG GGGCGCGATG
GGCTGGACCA GCGATGCGCT GGTCAAGGCA CTTTCCGAGC TTTCGCTCGG CCCATCGTCG
GCCGAGCGCA CGGCCAGGTT GCGCGCGCAG GTCGCGGCAG TGCTTCAGGC CGAATTGCCG
GTGATCCCGA TCGCCTGGTA CCGCCAACAG GTGGCGGTCA GCCAGCGTGT CGCCGGTGTG
AGCCTCGATC CGCTGGAACG GTCCTACCGG CTTACCGCGA TGAGGTGGAA CGCATGA
 
Protein sequence
MTLPIHLTRR RALQLCAVGP AVAASAGLLA QPALSPLQIV GPWELGGLAP ANSGYIFTRM 
QIAETLMEAR EDGTPLPGLA ERWGVSADGL AWRFTLRATA RFHDGTPVTA AAVVRCLQAA
RVAPALLSLA PIKSLDAEGA GVVLIRLASP YGGLPALLAH SSTMVLAPAS YGPDGRVRTI
VGSGPYRVVS LAPPQHVEAA AFDGYDGARP AVERVRYLAA GRAETRALMA EGGQADLAYG
LDPASLVRLR KRGQVRVDTV TLPRTVILKV NAGLPALKDL RVRQALSLCI DRAGIAKALL
RDPELAATQL FPPTLKAWHD PALAPLTHDP AAAARLLAEA GWRRAADGLR DASGQPLRLS
LRTFPDRPEL PVIASALQEQ WRQAGIAVQV GVGNSGDIPL GHRDGSLQLG LAARNYATVP
DPTGTLMQDF GASGGDWGAM GWTSDALVKA LSELSLGPSS AERTARLRAQ VAAVLQAELP
VIPIAWYRQQ VAVSQRVAGV SLDPLERSYR LTAMRWNA