Gene Vapar_4704 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagVapar_4704 
Symbol 
ID7971714 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameVariovorax paradoxus S110 
KingdomBacteria 
Replicon accessionNC_012791 
Strand
Start bp5000528 
End bp5001643 
Gene Length1116 bp 
Protein Length371 aa 
Translation table11 
GC content67% 
IMG OID644795289 
ProductExtracellular ligand-binding receptor 
Protein accessionYP_002946575 
Protein GI239817665 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0683] ABC-type branched-chain amino acid transport systems, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATGAAGA TGTTGTCCAC GTCCCGCGCT GCTGCGCTGC TCTTCGCCCT GTCCGCGCTC 
GGCGCGTCGG CGCAGATCGT GATCGGCCAG TCGGCCGACC TGTCGGGCCC GGTGGCCGCC
AGCGTGAAGG AAACCATCAT GGGCTCGCAG CTGGTCATCG ACCAGGTCAA CGCCCAGGGC
GGGATCAACG GCGAGCAGGT GGAGGTGATC CGGCTCGACG ACGGCCTGGA CGCGAAGCGC
TCGCTGGAGA ACACGCGCAT CCTGATCGAG GACAAGAAGG TGCTGGCGCT GCTGCTCAAC
CGCGGCACGC CCAACACCCT GGCCGTGATC CCGCTGCTCG ACAAGTACGG CGTGGCGCTG
GTGGGCCCTT CCACCGGCGC GATGGCGCTG CACAAGCCGC TGCAGAAGAA CATCTTCAAC
GTGCGCTCGA CCTACCAGCG CGAGGCTGAA AAAGCGGTGC AGCACCTGCA CACCACGGGC
ATCCAGCGCA TTGCCGTGGT GCAGGCCGAC GACTCCTTCG GCAAGGACGC GATGGAAGGC
GCGAGCAAGG GCTTCGAGAA GGCCGGGCTC GCACCGGCGG TGCTGGCGCT GGCCGACCGC
AGCAAGCCCG ACTATTCGGC GATCGTGCCG CAGCTGGTCA AGGCCAATGC GCAGGCGGTG
CTGTGGATCG GCTCGGGCAC CGCGGTGACC GAAGGCGTGA AGGCGCTGCG GGCCGCGGGT
TCGGCGGCGC AGATCATCAC GCTGTCGAAC AATGCGGCTT CGGGCTTCAT CAAGGAGCTG
GGCTCGGCCA GCGCCGGCGT GATCGTCACG CAGGTGCTGC CCTACGAGCG CTCGTTCGGC
CATCCGCTGA TCAAGGAGGC GATGGCGCTC GCCAAGGCCA AGGGGCAGAC CGAGCTGTCG
CCCGCGCTGC TCGAAGGCTT CGTCGCCACC AAGGTGATGG TGGAGGCGCT GCGCCGCACG
GGCCCCAAGC CCACGCGCGC CCGGCTGATT GCCACGCTCA ACAGCTTCCA GTACGACCTG
GGCGGCAACA TCGACGTGAG CTATTCGCCG ACGGACCACA CGGGCATCGA CTACGTCGAC
CTGTCGATCA TCAGCGAAGG CCGCTTCAAG CGCTGA
 
Protein sequence
MMKMLSTSRA AALLFALSAL GASAQIVIGQ SADLSGPVAA SVKETIMGSQ LVIDQVNAQG 
GINGEQVEVI RLDDGLDAKR SLENTRILIE DKKVLALLLN RGTPNTLAVI PLLDKYGVAL
VGPSTGAMAL HKPLQKNIFN VRSTYQREAE KAVQHLHTTG IQRIAVVQAD DSFGKDAMEG
ASKGFEKAGL APAVLALADR SKPDYSAIVP QLVKANAQAV LWIGSGTAVT EGVKALRAAG
SAAQIITLSN NAASGFIKEL GSASAGVIVT QVLPYERSFG HPLIKEAMAL AKAKGQTELS
PALLEGFVAT KVMVEALRRT GPKPTRARLI ATLNSFQYDL GGNIDVSYSP TDHTGIDYVD
LSIISEGRFK R