Gene Dole_3009 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDole_3009 
Symbol 
ID5695868 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfococcus oleovorans Hxd3 
KingdomBacteria 
Replicon accessionNC_009943 
Strand
Start bp3611015 
End bp3612262 
Gene Length1248 bp 
Protein Length415 aa 
Translation table11 
GC content59% 
IMG OID641265625 
Productextracellular ligand-binding receptor 
Protein accessionYP_001530889 
Protein GI158523019 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0683] ABC-type branched-chain amino acid transport systems, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000545529 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAACGAT TCGCAGCGAT AATCATCATC AGCGCCGCAC TGGCCCTGTG CTTCGGCACG 
GCCGGCATGT GCGAGGTCGG CGTGACCGAT ACCGAGATTC ACATCGGCCA GTGGGGTCCC
CAGACCGGCC CGGCCGCGCC CTGGGGCGCC GTGGCCCGGG GTACCGACGC CTACTTTAAA
ATGATCAATG CCGAAGGCGG CATTCACGGC CGCAAGCTGG TCCATCACTA TTTTGACGAT
GCCTACAACC CGGCCAAGAC CGTGGCCGGC GTCAAGCAGC TTCAGGAACA GGAGGGCATG
TTCGCCTGGG TCAGCGGCGT GGGCACGGCC ACGGGCCTGG CGGTCAAGGA CTACCTGATG
GAAAACAAGA TTCCCTGGAT CGGTCCGTCT GCCGGTTCCC GCCACTGGGT GGAGCCGCCC
CAGAAATACC TGTTCAACGT TTATCCCTTC TACATGGGCG ATGCCCAGCT CCTGTGCCAG
TATGCCGTTG AAACCATGGG CAAGAAGAAA ATTGCCATTG CCTTTCAGAA TGACGACTAC
GGCAAGCAGG GCGTGGAAGG CGCTGAGTAC CAGCTTAAAA AGGCGGGCCT GGAGCTGGCC
GTCAAGGTGC CGGTCAACGT GGCTGATACC GACATGATTC CCCATGTCAT GGAGCTGAAA
AAAGCCGGGG CCGACGCGGT GCTGCTGTTT GTCACCCCTG GTCATGTGGC CCGCATCATC
GGCACGGGCA AGGCCATGCA GTTTGAGCCC ACCTGGATGT CCACCTCCAC CTGCGGTGAC
TTTCCCCTGA TGATGGCCAT CACCAAGGGC CTTTGTAAGG GCCTGATCAC GGCATCTTTC
GGTCTTGCCG AACCCACGGG CCATGTGGGG GAAGTCCAGC TTCTTGATAA TCCGGTTCAG
AAAATGGTCG CCAAGTACAA GACCGATGCC TTTGACAAGT TCGCGGCCAA GGATGAACGG
TACGGCTACA CCTTTCTCGC GGGTATCGGC TTTGCCGAGC CCCTGGTGGA GGCCATCCGC
CGCTGTGGAA AGGACCTGAC CCGGGAGAAA CTGGTCAAGG AACTGGAAAA CATGAAGAAC
TTCAAGGGCG TCCTGGGCCG TATCAACTAC AAGCCCTTTG ACCCCAAGGA CCCCCTCTGT
CGCCTGGGCC AGGGAGAGGT CTTTCTCCAG GAGTGCACGG AAAACGGCGG ATCCAAGATC
CTGACCGACT GGGTAACAAC CACCTACCTG CCGTCAAAGG CGGAATAA
 
Protein sequence
MKRFAAIIII SAALALCFGT AGMCEVGVTD TEIHIGQWGP QTGPAAPWGA VARGTDAYFK 
MINAEGGIHG RKLVHHYFDD AYNPAKTVAG VKQLQEQEGM FAWVSGVGTA TGLAVKDYLM
ENKIPWIGPS AGSRHWVEPP QKYLFNVYPF YMGDAQLLCQ YAVETMGKKK IAIAFQNDDY
GKQGVEGAEY QLKKAGLELA VKVPVNVADT DMIPHVMELK KAGADAVLLF VTPGHVARII
GTGKAMQFEP TWMSTSTCGD FPLMMAITKG LCKGLITASF GLAEPTGHVG EVQLLDNPVQ
KMVAKYKTDA FDKFAAKDER YGYTFLAGIG FAEPLVEAIR RCGKDLTREK LVKELENMKN
FKGVLGRINY KPFDPKDPLC RLGQGEVFLQ ECTENGGSKI LTDWVTTTYL PSKAE