Gene Vapar_4795 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagVapar_4795 
Symbol 
ID7970265 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameVariovorax paradoxus S110 
KingdomBacteria 
Replicon accessionNC_012791 
Strand
Start bp5110167 
End bp5111429 
Gene Length1263 bp 
Protein Length420 aa 
Translation table11 
GC content67% 
IMG OID644795390 
ProductExtracellular ligand-binding receptor 
Protein accessionYP_002946666 
Protein GI239817756 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0683] ABC-type branched-chain amino acid transport systems, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGCCCA AGCACGCGCC ACTGGCGCCG ATCGCCCTTG CCTGCCTGCT GGCCGCTGCC 
GGCGGCGCCC AGGCGCAGCA ACAGCAGCCC AGCTACAAGA TCGCCTACAT CGACCCGCTC
TCAGGGCCGT TCGCGAACGT GGGCGAGCTG ATGCTCACGC ACACGCAGTA CGCCATCGAG
GAGATCAACG CCAAGGGCGG TGTGCTCGGC GGCACGAAGC TGCAGTTGCT GCAGTTCGAC
AGCAAGCTCT CGGCGCAGGA GAGCCAGAGC GCGCTGCAGG CGGCCATCGA CCAGGGCGCC
CAGGCCATCG TCACGGGTGG GTCGGGCTCC TCGGTGGTGT CGGCACTGGT GCAGTCGGTG
GCGCGCTGGA ACCAGCGCAA TCCGGGCAAG GAGCTGATCG TGCTGAACCA TTCGTCGATC
GACCCCGAGA TGACCGGCAA GAACTGCAGC TTCTGGCACT TCCAGACCGA GGCCAACACG
GCGATGAAGA TGAAGGCGCT GGCCAACTAC ATCAAGAAGA CGCCGGACGT GAAGAAGGTC
TACCTGCTGA ACCAGGACTA CGCCCACGGC AAGCAGTGGG CGAGCCACGG GCGCCAGCTG
GTGGGCCTGG CGCGGCCCGA CGTGCAGTTC GTCGGCGAAA CGCTGCATCC GATCGGCCGC
GTAAAGGACT TCTCGCCCTA CATCGCCAAC ATCAAGCAGA GCGGCGCCGA CTCGGTCATC
ACCGGCAACT GGGGCCAGGA CATGACGCTG CTGCTCAAGG CCGCGGGCGA TGCGGGCTAC
AACCTGCGCT ACTTCAACCA CAGCGCGGGC TCGGTGCCGG GCACGGTGCT GGCGGTGTCG
CAGGCCAAAC TCGGGCAGCT GACCTGGGTG GCCGAGTGGC ATCCGGGGCA GGCCGACACG
CCGCGCGCCG ATGCGCTGGC CAAGGCGTAC AAGGCGAAGA CGGGCAAGGA CTTCCTCGCG
CCGCGCATCG ACTTCACGCC GCGCCTTCTG GCCGCCGCCA TCAACAAGGC CGGCTCGACC
GACACGGTGA AGGTGGCGCG TGCGCTCGAG GACATGAGCT ACGACTCGGT GGTGGGGCCG
ATCCGCATGC GCGCCGAGGA CCACCAGTTG CTGCTGCCGC AGGTGGTCAA CACGATCGCG
CCGGTCGATG GCAAGAGCGT GAAGGTGGGC TGGGAAGGGA CGAACTACGG CTTCCGGACG
GACGCTGTCT ACACGGGCAA CGAGCTGGCG CAGGGGTCTG AGTGCAAGAT GGTTCGGCCT
TGA
 
Protein sequence
MKPKHAPLAP IALACLLAAA GGAQAQQQQP SYKIAYIDPL SGPFANVGEL MLTHTQYAIE 
EINAKGGVLG GTKLQLLQFD SKLSAQESQS ALQAAIDQGA QAIVTGGSGS SVVSALVQSV
ARWNQRNPGK ELIVLNHSSI DPEMTGKNCS FWHFQTEANT AMKMKALANY IKKTPDVKKV
YLLNQDYAHG KQWASHGRQL VGLARPDVQF VGETLHPIGR VKDFSPYIAN IKQSGADSVI
TGNWGQDMTL LLKAAGDAGY NLRYFNHSAG SVPGTVLAVS QAKLGQLTWV AEWHPGQADT
PRADALAKAY KAKTGKDFLA PRIDFTPRLL AAAINKAGST DTVKVARALE DMSYDSVVGP
IRMRAEDHQL LLPQVVNTIA PVDGKSVKVG WEGTNYGFRT DAVYTGNELA QGSECKMVRP