Gene Vapar_3874 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagVapar_3874 
Symbol 
ID7969731 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameVariovorax paradoxus S110 
KingdomBacteria 
Replicon accessionNC_012791 
Strand
Start bp4103872 
End bp4104852 
Gene Length981 bp 
Protein Length326 aa 
Translation table11 
GC content72% 
IMG OID644794460 
Productsubstrate-binding region of ABC-type glycine betaine transport system 
Protein accessionYP_002945754 
Protein GI239816844 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0715] ABC-type nitrate/sulfonate/bicarbonate transport systems, periplasmic components 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACCGACC TGCTGTTCTC CAAGGCCTCG CGCCGCCATT GGCTCAAGCA GGGCGCGGCG 
CTCTCGTTCG CAGCCACCGC GCCGCTGGGC GCATGGGCGC AAGCGCAAAC CACGCTGGTG
CTCGGCGACC AGGCCGGCGG CCTGCGCGCG CTGTTCGAGG CCTCGAAGGC GCTCGAGGGC
GTGCCCTTCG CCTACCGCTG GGCCAACTTC CAGGGCGCGG CGCCGCTGTT CGAGGCGCAG
CGCAGCGCGG CCGTCGACAC CGCGGTGGCC GGCGACCTGC CGGTGCTGGC GGCGGCGGTC
GGCCGGACGC CGCTGAAGAT CGTCGCCACG CGCGTCGGCA AGGCCGATGC GCTGGGCATC
GTGGTGCAGC CCGATTCGCC GTTGCGCCAG GTGGCCGACC TGCGCGGCAA GACGGTGATC
GTGTCGTCCG CGCGCGGCAG CATCTCGCAA TACCAGCTCT ACGGCGCGCT CGAGGAAGCC
GGCGTGCGGC GCGACGAGGT CACGGTGAAG TTCGTGCTGC CGACCGATGC GGCCGCGGCC
TTTGCCTCGA AGCAGATCGA TGCCTGGGCT GTGTTCGATC CCTACTACAC GATCGCGCTG
CAGCAGGGCG GGCGCATCCT GCGCGATGGG CGCGGCATCA ACACGGCGCT GGGCTTCATC
ACCGCGAGCG AGCCGTCGCT CGCCGACCCC GCCAAGCGCG CCGCCATCGT GCAGTTTCTG
GACCGGCTGG CGCGCGCGGG CGAATGGGCG CTGGCCACCC CCGAGGCCTA TGCGCAGGCC
TACAGCCAGC TCACGCGCCT GCCGATCGAA TCGGCCCGCA TCATCACGGC GCGCGCCTCG
GTCACGGGCC GGCCAGTGTC CGAAGCCGAC ATTGCCGCGC TGCAGACGGT GGCCGACCGC
TCGGCGCGCG ACGGCATCCT GCCGCTGCGC GTTGACGTGC GCGCCATCAC CGATGCGCAG
CTGTGGAAGC GTCCCGCGTG A
 
Protein sequence
MTDLLFSKAS RRHWLKQGAA LSFAATAPLG AWAQAQTTLV LGDQAGGLRA LFEASKALEG 
VPFAYRWANF QGAAPLFEAQ RSAAVDTAVA GDLPVLAAAV GRTPLKIVAT RVGKADALGI
VVQPDSPLRQ VADLRGKTVI VSSARGSISQ YQLYGALEEA GVRRDEVTVK FVLPTDAAAA
FASKQIDAWA VFDPYYTIAL QQGGRILRDG RGINTALGFI TASEPSLADP AKRAAIVQFL
DRLARAGEWA LATPEAYAQA YSQLTRLPIE SARIITARAS VTGRPVSEAD IAALQTVADR
SARDGILPLR VDVRAITDAQ LWKRPA