Gene Dole_3001 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDole_3001 
Symbol 
ID5695860 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfococcus oleovorans Hxd3 
KingdomBacteria 
Replicon accessionNC_009943 
Strand
Start bp3602056 
End bp3603354 
Gene Length1299 bp 
Protein Length432 aa 
Translation table11 
GC content61% 
IMG OID641265617 
Productmajor facilitator transporter 
Protein accessionYP_001530881 
Protein GI158523011 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2271] Sugar phosphate permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATGCAT CCCTGGTTGC CAAAATGCTG CGGTACCGCT GGTGGATTTT CCTGATCCTG 
GGGGTGGCCT ACCTGCTGGT CTATTTCCAC CGCTTGTCTT TGTCCGTGGT GGCCGACGAC
CTTATTTTCG AGTTTGCCAC ATCCGCCGGG GTGATGGGCC TGCTCTCCTC CATCTATTTT
TACTGCTACG CGGTGATGCA GCTGCCGGCG GGCCTGCTGT CCGACTCCAT CGGGCCCAGG
CGCACGGTCA GCGCCTTTCT GCTGGTGGCC GCGGCCGGCA GCATTATGTT CGGCATGGCC
CCCACCATCG AGGTGGCCTT TTTTTCCCGG GTCCTGGTGG GGTTCGGCGT CTCCATGGTG
TTTATTCCCA CCATGAAAAT CCTGGCCCAG TGGTTTCGAA AAGACGAGTT TGCCTCCATG
GCCGGGCTGT TCAATGCCGT GGGCGGCATG GGGGTGCTGG CCGGCACCTG GCTGCTGGGA
TACATGGCGC AGTCCATGGG CTGGCGAATC TCTTTTGTGC TGATCGGCGC GGGCACCCTG
GTGATCGTGG TGCTGGCCTG GCTGGTGGTG CGCGACCGGC CCCAGGACAA GGGATGGCCC
TCCATCGCGG ACATTGAAAA CCAACAGGCA ACCCCGCCCC CTGCCGCCAT TCCCCTGCTG
GCGGGCCTGG GCCGGGTGCT GTCGGAAAAA TCCTTCTGGC CGGTGGCGGT GTGGTTCTTT
TTTGACTGCG GCCTTTTTTT CGGGTTCGGC GGCCTGTGGG CAGGGCCCTA CCTGATGCAT
GTCTACGGTC TGTCCAGGGC ACAGGCCGGC GGCGTGCTCT CCATGATCGC CTGGGGCATG
ATCATCGGCA GCCCGCTGAT GGGGTTTTTC TCCGACCGGG TGGTCAAAAG CCGTAAAAAA
CCCTTTATCA TCTGTGGCGT GGTGCTGTGC GCCGAGATGC TGTTTCTCTA CTTAAACCCC
GACGGGCTTT CCCTGGCCGC GCTTTACGGG GTGTTCTTTG TCTTTTCCAT CTGCGCGTCA
TCCATCGTAA TCGTTGGGTT TACCACCACC AAGGAGCTGT TTCCGGTCTC CATGGCGGGT
ACCTCGGTGG GCGCGGTCAA CCTTTTTCCC TTTCTGGGCG GCGCAATCTA CATGCCGCTG
CTGGGCCGGG TGCTGGACAG TGTGCCCCAG CCCACGCCCG GCTCATATGC CCTGGAGGGC
TATACCCTTA TGCTGCTGGT GCTGCTGGCA TCCGCGGTGG CGGCCCTGTG CTGCACCTTT
TTCATGAAGG AGACCTTCAC AAAGCAGGCC CACGGCTGA
 
Protein sequence
MNASLVAKML RYRWWIFLIL GVAYLLVYFH RLSLSVVADD LIFEFATSAG VMGLLSSIYF 
YCYAVMQLPA GLLSDSIGPR RTVSAFLLVA AAGSIMFGMA PTIEVAFFSR VLVGFGVSMV
FIPTMKILAQ WFRKDEFASM AGLFNAVGGM GVLAGTWLLG YMAQSMGWRI SFVLIGAGTL
VIVVLAWLVV RDRPQDKGWP SIADIENQQA TPPPAAIPLL AGLGRVLSEK SFWPVAVWFF
FDCGLFFGFG GLWAGPYLMH VYGLSRAQAG GVLSMIAWGM IIGSPLMGFF SDRVVKSRKK
PFIICGVVLC AEMLFLYLNP DGLSLAALYG VFFVFSICAS SIVIVGFTTT KELFPVSMAG
TSVGAVNLFP FLGGAIYMPL LGRVLDSVPQ PTPGSYALEG YTLMLLVLLA SAVAALCCTF
FMKETFTKQA HG