Gene Dole_2765 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDole_2765 
Symbol 
ID5695622 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfococcus oleovorans Hxd3 
KingdomBacteria 
Replicon accessionNC_009943 
Strand
Start bp3333076 
End bp3334275 
Gene Length1200 bp 
Protein Length399 aa 
Translation table11 
GC content60% 
IMG OID641265379 
Productglycine betaine/L-proline ABC transporter, ATPase subunit 
Protein accessionYP_001530645 
Protein GI158522775 
COG category[E] Amino acid transport and metabolism 
COG ID[COG4175] ABC-type proline/glycine betaine transport system, ATPase component 
TIGRFAM ID[TIGR01186] glycine betaine/L-proline transport ATP binding subunit 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0077077 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACGGAAA AAATTGTAAT TGAAAATCTT TACAAGATTT TCGGGCCCAA CCCGGAGGCG 
GCAATGAAGC TGCTTGCCCA GGGCATGGGC AAGGACGAGA TCATGGAAAA GACCCGGCAC
GGGGTGGGGG TGGCCGATGC TTCTTTTACC GTGAAAGAGG GGGAGATCCT GGTGGTCATG
GGGTTGTCCG GCAGCGGCAA GTCCACCCTG GTGCGGTGCA TCAACCGGCT GATCGATCCC
ACTGCCGGAA AGGTGATCGT CGACGGCCAG GATGTGACAA AGCTGGACAA GGAGCAGCTG
CGGCGTTTCC GGTTGCAGCA TTTCGGCATG GTGTTCCAGA ATTTTGCCCT GTTTCCCCAC
CGCACGGTCT TGGACAACGT GGCCTACGGC CTTGAAATTC AGAAGCTGGA GCCGGTGGCC
CGCAAAAAAC GGTCCCTGGA GGCCCTGGCC CAGGTGGGGC TGGAGGGGTG GGCCGACTCC
TATCCCGGTC AGCTCAGCGG CGGCATGCAG CAGCGGGTGG GCCTGGCCCG GGCCCTGGCC
CTGGACGCGG ATGTCATGCT GATGGACGAG GCGTTCAGCG CCCTGGACCC GTTGATCCGC
GGCGACATGC AGGACGAACT GCTCACCCTT CAGGACAGGA TGCAGAAGAC CATCGTGTTT
ATCAGCCATG ACCTGGACGA GGCCCTCAAG CTGGGGGACC GGATCGTGCT GATGAAGGAC
GCCCGTATCG TACAGGCCGG AACAGCCGAG GAGATTCTCT CCCACCCGGC CAATGATTAT
GTGGCCAAGT TCGTGGAAGA CGTGGACATG ACCAAGGTGA TCACCGCCGA ACGGGTGATG
ATCAAACCCA AGGAGCTGGC CTATTTTCAC ACCGACGGCC CCAAGGCGGC CCTGCGAAAG
ATGCAACACT CCCAGATTTC CCAGATTTTC GTGAGAAAAG ACAAAAAGCT CTACGGCTAT
GTTACGGCTG ACACCGCCGC CGAGGCGGCC GGGAGGGGGG ATCCCACCCT TGAAAAGATC
GTGAACACCG ATATTGAGAC AGTGGCCCTG GACACCCCGG CGGTGGAGAT CATTCCGCTG
CTGGCCCGGC TGCCCTACCC GGTGCCGGTG GTGGATGAGG CCGGCCGGTT AAAGGGCGTG
ATCATAAAAG GGTCGTTGCT GGCCGGACTT TCAGAAAGGG GGACCATTGG AAATGTATAG
 
Protein sequence
MTEKIVIENL YKIFGPNPEA AMKLLAQGMG KDEIMEKTRH GVGVADASFT VKEGEILVVM 
GLSGSGKSTL VRCINRLIDP TAGKVIVDGQ DVTKLDKEQL RRFRLQHFGM VFQNFALFPH
RTVLDNVAYG LEIQKLEPVA RKKRSLEALA QVGLEGWADS YPGQLSGGMQ QRVGLARALA
LDADVMLMDE AFSALDPLIR GDMQDELLTL QDRMQKTIVF ISHDLDEALK LGDRIVLMKD
ARIVQAGTAE EILSHPANDY VAKFVEDVDM TKVITAERVM IKPKELAYFH TDGPKAALRK
MQHSQISQIF VRKDKKLYGY VTADTAAEAA GRGDPTLEKI VNTDIETVAL DTPAVEIIPL
LARLPYPVPV VDEAGRLKGV IIKGSLLAGL SERGTIGNV