Gene Dole_0833 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDole_0833 
Symbol 
ID5693668 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfococcus oleovorans Hxd3 
KingdomBacteria 
Replicon accessionNC_009943 
Strand
Start bp966496 
End bp967647 
Gene Length1152 bp 
Protein Length383 aa 
Translation table11 
GC content57% 
IMG OID641263430 
Productextracellular ligand-binding receptor 
Protein accessionYP_001528720 
Protein GI158520850 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0683] ABC-type branched-chain amino acid transport systems, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000000252201 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAAGA GTATCCTGGC AACAGCGGCA GTGCTGGTTG TCATGCTGAT GTGCGGCGGC 
ATGCTTTACG CCGCTTCGGA CGATTACCGG GTGGGGTGTA TTTTTTCCGT TACCGGCAAG
GCCTCATGGC TGGGTGAACC GGAAAAAAAG ACCGCGGAAA TGCTGGCTGA AAAAATCAAC
GCCGCCGGCG GTATCAACGG TCACAAGCTG AAATTATATA TCGAGGATGA CCAGGGCGAC
AACACCCGGG CGGTCAACGC GGCAAAGAAA CTGATCAACA GGGACAAGGT ATGCGCCATT
ATCGGGCCGT CGGTTTCCGG CGCCACCATG GCGATTCTTC CGGTCATGCA GGAAGCCGAG
ATCCCGCTGG TCTCCTGCGC GGCCGCCGCG GTTATTGTCG AGCCAGTGGC CGAGAGGAAA
TGGATTTTCA AGACCCCCCA GAAAGACAGT GACGCCGTAC GGCGCATTTA CGAGCACATG
ATCTCCAGGG GCATCAAGGA TGTGGGGCTG ATTACCGGAA CCACCGGTTT CGGTAATGCC
GGACGCACCC AGCTCAAGGA CCTGGCCCCA GAATACAAGA TGAACATTGT GGCCGATGAA
ACCTACGGCC CGGCCGACAC GGACATGACC GCCCAGCTGG TCAACATCCG CAACGCCAAG
GCCCAGGCCG TTATCAACTG GTCCATCGTA CCGGCCCAGT CCATTGTTCC CAAGAACATG
AAACAGCTGA ACATGACGAT TCCGTTGTAC CAGAGCCACG GTTTTGGCAA CATTAAATAC
GTGGAAGCCG CCGGGGAAGC CGCTGAAGGG ATTATCTTTC CCGCCGGCCG GCTGCTGGCC
GTGGACACCC TTTCCGGCGA CAATCCCCAG AAAGCCCTGC TGGCCGCTTA CAAGGCCGAA
TACGAGGCCA GGTACAATGA GCCGGTGAGC ACCTTTGGCG GACATGCCTA TGACGCGCTC
AGCATCGTCG TAAAGGCGCT GGAAAAAGCC GGCGACGATC CGGCCAAAAT TCGTGACACC
ATTGAGACCA TTGAATTCGT GGGTACCGGC GGGGTCTTCA AGTTTTCGGC CGAAGATCAC
ACCGGTCTGG ACAAGAACGC TTTTGAAATG CTGACCGTCA AGGACGGGAA ATTCGTCGTC
CTGACAGACT AG
 
Protein sequence
MKKSILATAA VLVVMLMCGG MLYAASDDYR VGCIFSVTGK ASWLGEPEKK TAEMLAEKIN 
AAGGINGHKL KLYIEDDQGD NTRAVNAAKK LINRDKVCAI IGPSVSGATM AILPVMQEAE
IPLVSCAAAA VIVEPVAERK WIFKTPQKDS DAVRRIYEHM ISRGIKDVGL ITGTTGFGNA
GRTQLKDLAP EYKMNIVADE TYGPADTDMT AQLVNIRNAK AQAVINWSIV PAQSIVPKNM
KQLNMTIPLY QSHGFGNIKY VEAAGEAAEG IIFPAGRLLA VDTLSGDNPQ KALLAAYKAE
YEARYNEPVS TFGGHAYDAL SIVVKALEKA GDDPAKIRDT IETIEFVGTG GVFKFSAEDH
TGLDKNAFEM LTVKDGKFVV LTD