Gene Dole_0944 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDole_0944 
Symbol 
ID5693779 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfococcus oleovorans Hxd3 
KingdomBacteria 
Replicon accessionNC_009943 
Strand
Start bp1102025 
End bp1103041 
Gene Length1017 bp 
Protein Length338 aa 
Translation table11 
GC content58% 
IMG OID641263541 
Productbinding-protein-dependent transport systems inner membrane component 
Protein accessionYP_001528831 
Protein GI158520961 
COG category[R] General function prediction only 
COG ID[COG4174] ABC-type uncharacterized transport system, permease component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000124721 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGCGCAT ATTTTATTCG ACGATTCCTG CTGATCATTC CCACCTTTCT GGGCATCACC 
GTACTGGTGT TCGCGGTCAC GCGGTTTGTG CCCGGCGGCC CGGTGGAACG CATGATTGCC
GAGTCCTACC GCATGCAGGC CATGGAGGGC CGCACCCAGC GGGAGGCCAC CCAGCCCCTC
TCCGAAGAGC AGATCAATTA CCTGAAACGC TATTACGGGT TCGACAAACC GGTGCCGGCA
GCCTATGTGC TCTGGATGGG AAAAGTTCTG TCCGGCGACC TGGGCACCTC CACCCGGTAT
TATGACCCGG TATGGGAGAT GATCCGGTCC CGCATACCCA TATCGCTCTA TTTCGGCCTG
CTCTCTATGG TCATTATATA CGGGGTCTGC ATTCCCCTTG GCATGGCCAA GGCCGTGCGC
CACAAAAGCG GGTTCGACAA CTTCACCTCC GTAGCGGTGT TTGCCGGCTA CGCCGTGCCC
GGCTGGGTCC TGGGCATTCT GCTGCTGCTG CTCTTTTCTT CCCGGTGGGG GGTGCTGCCC
CTGGGCGGGC TCACAAGCGC CGGCTTTGAC GCCCTGTCCG GACCTGAAAA AATTCTCGAT
ATCGCCCGGC ACACGGTCCT GCCCCTGGCC GCCTACGTTG TGGGCTCCTT TGCCGTAATG
ACCTTTTTAA TGAAAAACAC CCTGATGGAC GAACTGGCCG CCGACTATGT GCGCACGGCC
ATGGCAAAAG GGCTGTCATT TAAAAAAGCG GTGTTCGGCC ATGCCTTAAG AAACAGCCTG
ATTCCCGTTG CCACCAGCTT CGGCAACAAC ATATCGGTCC TGGTCTCGGG CTCGTTTCTC
ATTGAAACGG TCTTCAACAT CAACGGCATG GGCCTTTTGG GCTACGAGTC GGTGGTGGAG
CGGGACTATC CCGTGGTCAT GGGCATTCTG GTGATCTCGT CGCTGCTGTT TTTAATCGGC
AACATTCTTT CCGATATCTG CGTGGCCTTT GTGGACCCGC GGGTGAGATT CCAGTAA
 
Protein sequence
MRAYFIRRFL LIIPTFLGIT VLVFAVTRFV PGGPVERMIA ESYRMQAMEG RTQREATQPL 
SEEQINYLKR YYGFDKPVPA AYVLWMGKVL SGDLGTSTRY YDPVWEMIRS RIPISLYFGL
LSMVIIYGVC IPLGMAKAVR HKSGFDNFTS VAVFAGYAVP GWVLGILLLL LFSSRWGVLP
LGGLTSAGFD ALSGPEKILD IARHTVLPLA AYVVGSFAVM TFLMKNTLMD ELAADYVRTA
MAKGLSFKKA VFGHALRNSL IPVATSFGNN ISVLVSGSFL IETVFNINGM GLLGYESVVE
RDYPVVMGIL VISSLLFLIG NILSDICVAF VDPRVRFQ