Gene Vapar_5468 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagVapar_5468 
Symbol 
ID7975162 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameVariovorax paradoxus S110 
KingdomBacteria 
Replicon accessionNC_012792 
Strand
Start bp169202 
End bp170809 
Gene Length1608 bp 
Protein Length535 aa 
Translation table11 
GC content72% 
IMG OID644796055 
Productprotein of unknown function DUF894 DitE 
Protein accessionYP_002947329 
Protein GI239820144 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.685954 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCTGATC TCGCAGATGC CGCAGAGGCC GCCCAGGCCG CGAAGCCGAG CAGCAGCTTT 
GCGCCGCTGC GCCAGCCGGT CTTTGCCGTG CTGTGGGCGG CCACCGTGCT CGGCAACATC
GGCAGCTTCA TGCGCGACGT GGCGAGCTCC TGGCTCGTGA CCGACCTGTC GGCCAGCCCC
ACGGCGGTGG CGCTGATCCA GACCGCGGCC ACGCTGCCGA TCTTCCTGCT CGCGATTCCG
GCCGGGGTGC TGTCCGACAT CCTCGACCGG CGGCGCTTCC TGATCTTCGT GCAGCTGGTG
CTGGCGGGCG TGAGCGGCAC GCTGCTGGTG CTCTCGCACA CCGGCGCGCT CACCGTCGAG
TACCTGATCG CGCTGACCTT CGTCGGCGGC ATCGGCGCCG CGCTCATGGG GCCGACCTGG
CAGTCGATCG TGCCCGAGCT GGTGCCGCGC GCCGACCTCA AGAACGCGGT GGCGCTGAAC
TCGCTGGGCA TCAATATTGC GCGCTCCATC GGGCCGGCCG CGGGCGGCCT GATCCTCGCG
AGCTTCGGCG CCGCCCTGAC CTATGGCGCC GACGTGCTCA GCTATGTGTT CGTGATCGCG
GCGCTGCTGT GGTGGAAGCG CCCGGCGGCC GCCGACAGCG GCCTGTCGGA GAACTTCCTG
GGCGCCTTCC GCGCCGGCCT GCGCTACACG CGCGCCAGCC GCGAGCTGCA CCGTGTGCTG
CTGCGCGCGG CGGTGTTCTT CCTGTTCGCC AGTTCGGTGT GGGCGCTGCT GCCGCTGGTG
GCGCGACAGA TGCTGGGCGG CAGCGCCGGC TTCTACGGCA TCCTGCTGGG CGCCGTGGGT
GCGGGCGCCA TCGGCGGCGC GCTGGTGATG CCGCGGCTGC GCGCGCGCTT CAATGCCGAC
GGCATGCTGC TGCTGGCCTC GCTGCTCACC GCCGGCGTGA TGGGCAGCCT GGTGTTCGCG
CCGCCGCAGT GGCTCGCAGT GCTGTTGCTG CTGGTGCTGG GCCTGGGCTG GATCATCGCG
CTCACCACGC TCAACGGCGT GGCGCAGTCG ATCCTGCCGA ACTGGGTGCG CGGGCGCGGC
CTGGCCGTGT ACCTCACGGT GTTCAACGGC GCAATGGCGG CCGGCAGCCT GGGCTGGGGC
CTGGTGGCGC AGGAGATCGG CGTGCCGTAC ACGCTGGTGG CGGGCGCCGC CGGGCTGGTG
GTCGTGGCCC TGCTGTTCCA CCGTGCACGC CTGCCCACCG GCGATTCGGA CCTGCAGGCC
TCGAACCACT GGCCCGAGCC ACTGGTGGCC GAGCCTGTCG CGCACGACCG CGGTCCCGTG
ATGGTGCAGG TCGAGTACCG CATCCGCAAG GAAGACCGCC CGGCGTTCCT GGACGCGATG
AAGCGGCTGT CGCTCGAGCG CCGCCGCGAC GGCGCCTACG CATGGGGCGT GACCGAGCAC
ACCAGCGACC CCGAGCGCGT GATGGAGTGG TTCCTGGTCG AGTCCTGGGC CGAGCACCTG
CGCCAGCACC ACCGCGTGTC GCATGCCGAC GCCGACCTGC AGAACGAAGC CGTGCGCTTT
CACATCGGGC CCGGCCGGCC CGAGGTGCAC CACTTCCTGT CGCTCTGA
 
Protein sequence
MADLADAAEA AQAAKPSSSF APLRQPVFAV LWAATVLGNI GSFMRDVASS WLVTDLSASP 
TAVALIQTAA TLPIFLLAIP AGVLSDILDR RRFLIFVQLV LAGVSGTLLV LSHTGALTVE
YLIALTFVGG IGAALMGPTW QSIVPELVPR ADLKNAVALN SLGINIARSI GPAAGGLILA
SFGAALTYGA DVLSYVFVIA ALLWWKRPAA ADSGLSENFL GAFRAGLRYT RASRELHRVL
LRAAVFFLFA SSVWALLPLV ARQMLGGSAG FYGILLGAVG AGAIGGALVM PRLRARFNAD
GMLLLASLLT AGVMGSLVFA PPQWLAVLLL LVLGLGWIIA LTTLNGVAQS ILPNWVRGRG
LAVYLTVFNG AMAAGSLGWG LVAQEIGVPY TLVAGAAGLV VVALLFHRAR LPTGDSDLQA
SNHWPEPLVA EPVAHDRGPV MVQVEYRIRK EDRPAFLDAM KRLSLERRRD GAYAWGVTEH
TSDPERVMEW FLVESWAEHL RQHHRVSHAD ADLQNEAVRF HIGPGRPEVH HFLSL