Gene Dole_3157 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDole_3157 
Symbol 
ID5696019 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfococcus oleovorans Hxd3 
KingdomBacteria 
Replicon accessionNC_009943 
Strand
Start bp3783820 
End bp3784869 
Gene Length1050 bp 
Protein Length349 aa 
Translation table11 
GC content60% 
IMG OID641265776 
ProductTRAP dicarboxylate transporter- DctP subunit 
Protein accessionYP_001531037 
Protein GI158523167 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1638] TRAP-type C4-dicarboxylate transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.00466763 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATGAAAA AGCAGGGCAT AATCAAAAAG GCGGTCTGGG TGCTGGCGGC TGTTGTCATG 
GCCGTCTGGG CGGCAGGACC TGTGACCATT GTCCGTGCCG ACACAGCGCC GAAGGTTGTC
TGGAAGGTGG GGACACTCAC GCCCAAGGGC GTGGGCTGGG CCCACCAGTT TGAAACCATC
ATGATGCCGG TTATTCAATC CGGCACCAGC GGCGAGCTGA AGGTGAAGGT GTACTGGGGC
GGCCTGATGG GAGACGATGA GGATATCGTG GCCAAGATGC GGGTGGGCCA GCTTCAGGCC
GCCGGCCTCA CCGGTCAGGG CGCCACCATT GCCTGCCCCG AGTTTGCCGT GGTGGAGCTG
CCCTTTCTTT TTAAGAGCTA TGCCGAGGTG GACCACATTC GGGAAAAGAT GTGGCCTGAA
TTCGACCGCC TGATGCAGGC CCGGGGCTTC AAGCTGCTGG CGTGGCTGGA TCAGGATTTT
GACCAGATAT ACTCGGTGAA GTGGAGTTTT ACGGATCTTG CCGATTTTCA GAAGGCCCGG
TTCATGACCT GGTACGGCAC TCTGGAAGAG CACCTGCTCA AGAGCCTCAA TGCCAGCCCC
ATTCCCGTGA ACATTCCCGA GCTGGCGCCC TCCCTGCGCC AGGGCGTGGC CGACTCCCTG
ATCGCGCCGG CCCTCTGGAT GATCGCCACT CAGCTCTACC CGGTGGTCAA CTACATGGTG
CCGTTAAAGA TCCGTTACTC CCCGGCAGTG GTTGTCTGTA CCCTGGATGC ATGGAACGGC
CTGTCGGCGT CGTCCCGGGC CGGCCTTGCC GCGGCCCGGC CGGAGATGGA AAAACAGTTT
GTGGCCGCCT CCCGTAAGGA CAATCAAAAG GCTATGGACG CCATGGTCAA ATACGGCATT
GTGCGGGTGG ACATGACCGA CGCCCAGGTG GAGACCATTC GAAAAGGGGC CGTGACCGTG
TGGGACGATC AGGCCGATAA ACTTTATTCC AGGGAACTGC TTGACCGGAT ACTGGTCCAT
CTGGACCAGT ACAGGAGCCA AACCCCGTGA
 
Protein sequence
MMKKQGIIKK AVWVLAAVVM AVWAAGPVTI VRADTAPKVV WKVGTLTPKG VGWAHQFETI 
MMPVIQSGTS GELKVKVYWG GLMGDDEDIV AKMRVGQLQA AGLTGQGATI ACPEFAVVEL
PFLFKSYAEV DHIREKMWPE FDRLMQARGF KLLAWLDQDF DQIYSVKWSF TDLADFQKAR
FMTWYGTLEE HLLKSLNASP IPVNIPELAP SLRQGVADSL IAPALWMIAT QLYPVVNYMV
PLKIRYSPAV VVCTLDAWNG LSASSRAGLA AARPEMEKQF VAASRKDNQK AMDAMVKYGI
VRVDMTDAQV ETIRKGAVTV WDDQADKLYS RELLDRILVH LDQYRSQTP