Gene Vapar_2519 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagVapar_2519 
SymbolproX 
ID7969996 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameVariovorax paradoxus S110 
KingdomBacteria 
Replicon accessionNC_012791 
Strand
Start bp2659361 
End bp2660371 
Gene Length1011 bp 
Protein Length336 aa 
Translation table11 
GC content67% 
IMG OID644793106 
Productglycine betaine transporter periplasmic subunit 
Protein accessionYP_002944411 
Protein GI239815501 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2113] ABC-type proline/glycine betaine transport systems, periplasmic components 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.395168 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGAAAA GAACCCGTCT CATCCTTGCC CGCGGCCTGC TGGCCCTCGG TTTCGCCGCC 
TTCGGCATGG GCGCACAGGC CGCCAATGAC CTGCCCGGCC AGGGCGTCAC CGTGCAGCCG
CTCAAGAGCT CGCTCGCCGA AGAGGCGTTC CAGACGCTGC TCGTGATGCG CGCGCTCGAG
AAGCTCGGCT ACACGGTGGA GCCCATGAAG GACCTCGAAC CCGCCACCGA GCACCTGGCC
ATCGCCAATG GCGACGCCAC CTTCATGGCC AACCACTGGA GCCTGCTGCA CGCCGACTTC
TACAAGAACA GCGGCGGCGA CGCCAAGCTG TGGCGCAAGG GCGTGTACTC GGACGGCGCG
GTGCAGGGCT ACCTGATCGA CCGCAAGACG GCCGAGCAGT ACAACATCCG CAGCATCGCG
CAGCTGAAAG ACCCGGCCAT CGCCAGGCTG TTCGATGCCG ACGGCGACGG CAAGGCCGAC
CTGACGGGCT GCAACCCCGG CTGGGGCTGC GAACTGGCGA TCGAGAACCA CCTGACGGCC
TACCAGCTGC GCGACACCGT CACGCACAAG CAGGGCAGCT ATGCCGCGCT GATGGCCGAC
ACCATCGCCC GCTTCAAGCA GGACAAGCCG GTGCTGTACT ACACCTGGAC GCCCTACTGG
GTCAGCGCGG TGCTGCGGCC CGGTGCGGAC GTGGTGTGGC TGCAGGTGCC CTTCTCGTCC
TCCCAGGGCG GCAACGCGGA CACGCAGCTT CCCAACGGCA AGAACTACGG CTTCCAGGCC
AACCAGGAGC AGATCGTCGC CAACCGGGCC TTCGTCGAGA AAAACCCGGC CGCCGGCCGG
CTGTTCGAGG TGATGAAACT GCCGATCGGC GACATCAACG CCCAGAACCT GCGCATGAGC
CAGGGCGCCA ACACGCAGCA AGACCTGGAG CGCCACACTG ACGGCTGGAT CCGGGCGCAC
CGGCCGCTGT TCGATGGCTG GATCGAGCAA GCCCGGGCCG CCGCAAGGTA G
 
Protein sequence
MKKRTRLILA RGLLALGFAA FGMGAQAAND LPGQGVTVQP LKSSLAEEAF QTLLVMRALE 
KLGYTVEPMK DLEPATEHLA IANGDATFMA NHWSLLHADF YKNSGGDAKL WRKGVYSDGA
VQGYLIDRKT AEQYNIRSIA QLKDPAIARL FDADGDGKAD LTGCNPGWGC ELAIENHLTA
YQLRDTVTHK QGSYAALMAD TIARFKQDKP VLYYTWTPYW VSAVLRPGAD VVWLQVPFSS
SQGGNADTQL PNGKNYGFQA NQEQIVANRA FVEKNPAAGR LFEVMKLPIG DINAQNLRMS
QGANTQQDLE RHTDGWIRAH RPLFDGWIEQ ARAAAR