Gene Vapar_3034 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagVapar_3034 
Symbol 
ID7973754 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameVariovorax paradoxus S110 
KingdomBacteria 
Replicon accessionNC_012791 
Strand
Start bp3192275 
End bp3194500 
Gene Length2226 bp 
Protein Length741 aa 
Translation table11 
GC content69% 
IMG OID644793618 
ProductTonB-dependent receptor 
Protein accessionYP_002944919 
Protein GI239816009 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1629] Outer membrane receptor proteins, mostly Fe transport 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0831829 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCCGGCC AGGCGCAGGA GGCCGCGCCG CAGGCGGGCA CGCTGCCGGC CGTCGAGGTG 
GTGGCCACCA CGCCCGTGCC GGGCATCGAG GTGCCGAAGG ACCAGATCCC GTCCAACGTG
CAGACCGCCG ACGACCGGCA CCTGCGCCGC GCGCAGAGCC TGAACCTGCC GGACTTCATG
GCCACGCAGC TGCCCAGCGT GAACGTGAAC GAGATCCAGG GCAATCCGTT CCAGGTCGAC
GTGAACTACC GCGGCTTCAG CGCGAGCCCG GTGCTCGGAA CGCCCCAGGG CCTGTCGGTG
TACCAGGACG GCGTGCGCAT CAACGAGCCC TTCGGCGACG TGGTGAACTG GGACCTCATC
CCGAAGGCCG CCATCTCGAG CATCACGCTG CTGCCGGGCT CCAACCCGCT GTTCGGCCTC
AACACGCTCG GCGGTGCGCT GTCGCTGCAG ACCAAGCGCG GCGACACGCA TCCGGGCACC
GAGCTGGAAC TGCAGGCCGG CTCCTTCGGC CGCGTGAGCA CCGAGCTCAC GCACGGCCGC
AAGCTGGCCG AAGGCGGGCA CCTGTTCCTT GCGCTCGGCG GCCTCAACGA GGACGGCTGG
CGCAACTACT CGCCTTCGCG CGTGCGCCAG CTGTTTGCGA AGGTGGGGCA GGACAGCGGA
AAGCTCTCCT GGGACCTGAG CTTCACCCAT GGCGACAACC GGATGATCGG CAACGGCCTG
CTGCCCGAGT CGATGCTGAT GCAGAACCGC AAGCAGGTGT ACACGCGGCC CGACCGCACC
GAAAACCGCA TGTCGATGCT CACGCTCAAT GCCAGCTACC GCCTGAGCGA CGTGCAGACG
ATCTCCATGA CGGCTTACAC GCGGCGCTCG CGCTACAGCA CGCTCAACGG CGACCTCAAC
GACGGTTTCA ATCCGCCGGA CAACGAAGCC ACGGGCGTGG AGAACCGCAC CTACACGCGC
CAGCGCAGCG AAGGCGTGGC GCTGCAGTCG ACCTATACCG CGGGCATTCA CCAGCTCACT
TTCGGCGCCT CCGTGGACCG TGCGCGCACG CACTTCCGCC AGACCGAGGC CGAGGGCATG
CTCGACTCCA CGCGCGCGGT GGTGCCGCAG GAAGAAGCCG AAGTCGATGC GCTGCTCGCG
GGCAAGAGCC GCACCGCGAG CATCTATTTT TCCGACCTGG TGAGCCTGCA GCCGAACCTG
CAGCTGAGCC TTTCGGGCCG CTACAACGAC ACTCGAGTGA GCACCCGCGA CGACGGGCGC
GCCTTGCTCG GGCTGTCCAC CCGGCTCGAT GGCGAAGGCC ATTACAAGAA GTTCAATCCT
GCCATCGGCC TCACCTGGCA GGCTACGCCG CGGCTCACGG CCTACGCGGG CTGGAGCCAG
GGCAGCCGCG CGCCGAGCCC GATCGAGCTC GGCTGCTCCG ATCCGGCCAA CGCCTGCGTG
CTGCCCAATG CGCTGCAGTC CGATCCGCCG CTCAAGCAGG TGGTGTCGCA GACCTTCGAA
ACCGGGCTGC GCGGCACGCT CGAGCCAGGC ATGCGGTGGA ATGCCTCGGT GTTCCGCACC
GTCAACAAGG ACGACCTGCT GTTCGTGAGC AGCGGGCTTT CGCGCGGCTA CTTCAGCAAC
TTCGGGCGCA CCCTGCGCCA GGGCGTGGAG CTCGGGCTCT CGCAGCAGAC CGAACGCGTC
GACTGGTCGC TGTCGTACAG CTACCTGCGC GCGAGGTACG ACTCGCCGGC CTGCCTGGTG
GCCGAAGCCA ACAGCAGCGC CGAGACCAGC CCCGCCTGCA CCGGCGAGGG CGAGATCGCG
GTGCGCCGCG GCGACCGCCT GCCGGGACTG CCGGCGCATT CGCTCAAGCT CAATGTCGAC
TGGCGCGTGA CGCCCGAGTG GACGCTGGGC GCGCAGTACC GCGTGTATTC GAAGCAGACG
GTGCGCGGCA ACGAGAACGG CCTGCACGCG CCCGACGGGG CCGACTTCAG CGGCAGCGGC
CGCATCGGCG GCTACGCGCT GCTCGACCTC ACGACGCGCT GGAAGCTCGG GCCCAACGTG
GAGCTTTTTG CCAAGGTGGC GAACGTGTTC AACCGGCGCT ACGCCACCGC CGGCCAGCTG
GGCCGCAGCG GCTTCGATGC GAGCGGCGCG GTGCTGGCGC CTGATGCATG GCGCAACGTG
CAGTTCGTGG CTCCCGGCGC GCCGCGCGCG GTGTGGATCG GCATGCGGGT GCAACTGGGC
GTCTGA
 
Protein sequence
MPGQAQEAAP QAGTLPAVEV VATTPVPGIE VPKDQIPSNV QTADDRHLRR AQSLNLPDFM 
ATQLPSVNVN EIQGNPFQVD VNYRGFSASP VLGTPQGLSV YQDGVRINEP FGDVVNWDLI
PKAAISSITL LPGSNPLFGL NTLGGALSLQ TKRGDTHPGT ELELQAGSFG RVSTELTHGR
KLAEGGHLFL ALGGLNEDGW RNYSPSRVRQ LFAKVGQDSG KLSWDLSFTH GDNRMIGNGL
LPESMLMQNR KQVYTRPDRT ENRMSMLTLN ASYRLSDVQT ISMTAYTRRS RYSTLNGDLN
DGFNPPDNEA TGVENRTYTR QRSEGVALQS TYTAGIHQLT FGASVDRART HFRQTEAEGM
LDSTRAVVPQ EEAEVDALLA GKSRTASIYF SDLVSLQPNL QLSLSGRYND TRVSTRDDGR
ALLGLSTRLD GEGHYKKFNP AIGLTWQATP RLTAYAGWSQ GSRAPSPIEL GCSDPANACV
LPNALQSDPP LKQVVSQTFE TGLRGTLEPG MRWNASVFRT VNKDDLLFVS SGLSRGYFSN
FGRTLRQGVE LGLSQQTERV DWSLSYSYLR ARYDSPACLV AEANSSAETS PACTGEGEIA
VRRGDRLPGL PAHSLKLNVD WRVTPEWTLG AQYRVYSKQT VRGNENGLHA PDGADFSGSG
RIGGYALLDL TTRWKLGPNV ELFAKVANVF NRRYATAGQL GRSGFDASGA VLAPDAWRNV
QFVAPGAPRA VWIGMRVQLG V