Gene Vapar_0399 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagVapar_0399 
Symbol 
ID7973543 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameVariovorax paradoxus S110 
KingdomBacteria 
Replicon accessionNC_012791 
Strand
Start bp431143 
End bp432783 
Gene Length1641 bp 
Protein Length546 aa 
Translation table11 
GC content63% 
IMG OID644791002 
Productextracellular solute-binding protein family 5 
Protein accessionYP_002942328 
Protein GI239813418 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGAAGA CCACCCAGCC ATCCGTTTCG CGCCGCTCGC CCAGGCTTGG CGCGCTGCTC 
GCGCCGGCGG CACTCGCCAT GCTCGCACTG GGCGCGGGCG CCGTGTCCGC CAAGACGCTG
GTCTATTGCT CGGAGGGCAG TCCCGAGAAC TTCTATCCGG GCGTCAACAC CACCGGCACC
TCGTTCGACG TGACCACGCA GGTCTACAAC ACCATTGTCG AGTTCGAGCG CGGCGGCACC
AAGGTCGTGC CGGGCCTGGC TGAAAAATGG GACATCTCCG CCGATGGCAC GGTCTACACC
TTCCACCTGC GCAAGGGCGT CAAGTGGCAC AGCACCAGCA AGAGCTTCAA GCCCACGCGC
GACTTCAACG CCGACGACTT CATCTTCATG CTCGAGCGGC AGTGGAAGGA GAGCGATCCC
TTCTTCAAGG TCACGAGCCA GAACCACTCC TACTTCAACG ACATGGGCAT GCCCAAGCTC
CTGAAGTCGG TGGACCGCAT CGACGACCTG ACCGTGAAGA TCACGCTCAA CCAGGCCGAG
GCGCCGTTCC TTGCCAACCT GGCCATGCAG TACGCGGGCA TCCAGTCGAA GGAATACGCC
ATTGCGATGC TGAAGGCCGG CACGCCCGAG AAGGTCGACC AGGACCCGAT CGGCACCGGC
CCGTTCTACC TCGTGCAATA CCAGAAGGAC GCGGTCATCC GCTTCAAGGC CTTCCCGCAG
TACTGGGGCG GCAAGGCGAA GATCGACGAC CTCGTGTTCG CGATCACGCC CGATGCCTCG
GTGCGCTGGG CCAAGCTGCA GAAGGGCGAA TGCCACGTCA TGCCGTATCC GAATCCGGCC
GATCTCGACG CGATCCGCAA GGACCCGAAC GTGCAGGTGC TCGAGCAGCC TGGCCTCAAC
GTGGGCTACC TTTCGTACAA CACCACCAAG AAGCCCTTCG ACGACGTGCG CGTGCGCAAG
GCCATCAACA TGGCGATCAA CAAGAAGGCG ATCATCGACG GCGTGTACCT GTCGACCGGC
GTGGCCGCGA AGAACCCGAT CCCGCCCACC ATGTGGTCCT ACAACGACGC GGTCAAGGAC
GATCCCTACG ACCCCGAAGC CGCCAAGAAG CTGCTGGCGC AGGCCGGCTT TCCCGATGGC
TTCTCGACCG ACCTGTGGGC CATGCCGGTG CAGCGGCCCT ACAACCCGAA TGCCAAGCGC
ATCGCCGAGC TGATGCAGGC CGACCTTGCC AAGATCAACG TCAAGGCCGA GATCAAGAGC
TTCGAGTGGG GCGAGTACCG CAAGCGCCTG CAGGCCGGCG AGCACCAGAT GGGCATGCTC
GGCTGGACCG GCGACAACGG CGACCCCGAC AACTTCCTCT ACACGCTGCT GGGCTGCGCC
TCGGCCAAGT CGGCCAGCGG CAGCAACATC TCCAAATTCT GCTACCAACC CTACGAAGAC
CTCGTGCTCA AGGCCAAGAG CGCGACCAAG CAGGCCGAGC GCGATGCGCT CTACAAGAAG
GCGCAAGTCA TCTTCAAGGA GCAGGCGCCG TGGTTCACCA TCGCGCACGC GGTGCAGCTG
AAGCCGGTGC GCAAGGAGGT GGTCGACTTC AAGCTCAGCC CCTTCGGCCG CCACACCTTC
TACGGCGTGG ACATCAAGTA G
 
Protein sequence
MKKTTQPSVS RRSPRLGALL APAALAMLAL GAGAVSAKTL VYCSEGSPEN FYPGVNTTGT 
SFDVTTQVYN TIVEFERGGT KVVPGLAEKW DISADGTVYT FHLRKGVKWH STSKSFKPTR
DFNADDFIFM LERQWKESDP FFKVTSQNHS YFNDMGMPKL LKSVDRIDDL TVKITLNQAE
APFLANLAMQ YAGIQSKEYA IAMLKAGTPE KVDQDPIGTG PFYLVQYQKD AVIRFKAFPQ
YWGGKAKIDD LVFAITPDAS VRWAKLQKGE CHVMPYPNPA DLDAIRKDPN VQVLEQPGLN
VGYLSYNTTK KPFDDVRVRK AINMAINKKA IIDGVYLSTG VAAKNPIPPT MWSYNDAVKD
DPYDPEAAKK LLAQAGFPDG FSTDLWAMPV QRPYNPNAKR IAELMQADLA KINVKAEIKS
FEWGEYRKRL QAGEHQMGML GWTGDNGDPD NFLYTLLGCA SAKSASGSNI SKFCYQPYED
LVLKAKSATK QAERDALYKK AQVIFKEQAP WFTIAHAVQL KPVRKEVVDF KLSPFGRHTF
YGVDIK