Gene Vapar_5209 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagVapar_5209 
Symbol 
ID7969868 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameVariovorax paradoxus S110 
KingdomBacteria 
Replicon accessionNC_012791 
Strand
Start bp5531904 
End bp5532959 
Gene Length1056 bp 
Protein Length351 aa 
Translation table11 
GC content68% 
IMG OID644795803 
ProductXylose isomerase domain protein TIM barrel 
Protein accessionYP_002947077 
Protein GI239818167 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1082] Sugar phosphate isomerases/epimerases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.625607 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAATGA TCAAGGGCCC GGCCATATTC CTGGCCCAGT TCGCCGGGGA CGAAGCCCCG 
TTCAATTCGC TCGACGCCAT TGCCGGCTGG GCCGCCGGGC TGGGCTACAA GGGCGTGCAG
ATCCCGAGCT GGGACGCGCG CCTGTTCGAC CTGAAGAAGG CCGCCGAGAG CAGGACCTAC
TGCGACGAGG TCAAGGGCAC GCTGGCCGCG CACGGCCTGG AGATCACCGA GCTGTCGACC
CACCTCCAGG GCCAGCTGGT CGCGGTGCAC CCGGCCTACG ACGCGGGCTT CGACGGCTTC
GCGGCGCCCG AGGTGCGCGG CGATCCGGTG CGGCGCCAGC AGTGGGCGGT GGAGCAGCTG
CACTTCGCCG CGAAGGCCTC GGCCAACCTG GGGCTCACGG CGCATGCCAC CTTCTCGGGC
GCGCTCGCCT GGCCGTACCT CTATTCGTGG CCGCCGCGCC CGCCGGGGCT CATCGAGGAG
GCCTTCGACG AGCTTGCGCG CCGCTGGCGC CCGATCCTCG ATGCCTTCGA CGCGGCGGGC
GTGGACGTGG GCTACGAGAT CCACCCGGGC GAAGACCTGC ACGACGGCGT GAGCTACGAG
ATGTTCCTGG AGCGCGTGAA CAACCACCCG CGCGCCTGCC TCCTGTACGA CCCGAGCCAC
TTCATGCTGC AGCAGCTCGA CTACCTGGCC TACATCGACC ACTACCACGA GCGCATCAAG
ATCTTCCACG TGAAGGATGC CGAGTTCAAC CCGACCGGCA AGCAGGGCGT CTACGGCGGC
TTCCAGAGCT GGATCAACCG CGCGGGGCGC TTCCGCTCGC TCGGCGACGG GCAGGTCGAT
TTCAGCGCCA TCTTCTCGAA GATGGCGCAG TACGACTTCC CGGGCTGGGC GGTGCTCGAA
TGGGAGTGCT GCATCAAGCA CCCCGAGGAC GGCGCGCGCG AGGGCGCGGC CTTCATTGCC
GACCACATCA TCCGGGTGGC GGAGCGGGCT TTTGACGATT TTGCGGCGGG CGGCGTGGAC
GTGGCCGCCA ACAAGCGGAT GCTGGGGATC GGCTGA
 
Protein sequence
MKMIKGPAIF LAQFAGDEAP FNSLDAIAGW AAGLGYKGVQ IPSWDARLFD LKKAAESRTY 
CDEVKGTLAA HGLEITELST HLQGQLVAVH PAYDAGFDGF AAPEVRGDPV RRQQWAVEQL
HFAAKASANL GLTAHATFSG ALAWPYLYSW PPRPPGLIEE AFDELARRWR PILDAFDAAG
VDVGYEIHPG EDLHDGVSYE MFLERVNNHP RACLLYDPSH FMLQQLDYLA YIDHYHERIK
IFHVKDAEFN PTGKQGVYGG FQSWINRAGR FRSLGDGQVD FSAIFSKMAQ YDFPGWAVLE
WECCIKHPED GAREGAAFIA DHIIRVAERA FDDFAAGGVD VAANKRMLGI G