Gene Dole_1231 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDole_1231 
Symbol 
ID5694065 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfococcus oleovorans Hxd3 
KingdomBacteria 
Replicon accessionNC_009943 
Strand
Start bp1469564 
End bp1470583 
Gene Length1020 bp 
Protein Length339 aa 
Translation table11 
GC content60% 
IMG OID641263824 
Productaminotransferase class I and II 
Protein accessionYP_001529114 
Protein GI158521244 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0079] Histidinol-phosphate/aromatic aminotransferase and cobyric acid decarboxylase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCAGCA ACGTCAATCC CCTGGGGCCC CCGGCCGGGC TGATCGCGTA TCTTTGCGAT 
CGGATGACCG ACATTGTCTG TCTGCCGGAT CCTGATGCCT TGCACATCCG CCGGTCCTTT
GCAAAGCGCC ATGGCCTGGC CCCGGGACAC GTGGTGGCGG GCAACGGCAC CACCCAGTTG
ATTTACGCCC TGCCGCCGGC CCTGGGGTTG GGGCGGGTCC TGGTGCTGGG GCCGGCCTAT
GCCGATTACG CCCAGGCCTG CGCCATGCAC AACGTGCCCT GTGATTTTCT GCTTGCCGAT
GAGGCAGGCG GCTTTCGGCA TGATGCCGGC GTAATTGCCC GAAAGATACG GGAGGCAAAG
CCGGACGCGG TGTTTGTCTG CAACCCTAGC AACCCCACGG GCGTGTTGAT GGACCGGCAG
GTGATTCTGG ATCTGTGCAA TGAGAGCCCT GATGTGCTGT TTGTGATTGA TGAGTCCTAC
CTGCCGTTTG TCCGAGAAGG GGAGTCCCTC AGCCTGATCA ACGGGCCCGG CATGGACAAT
TTGCTGGTTT TAAGCTCCAT GTCGAAAATA TTCCGCGTAC CCGGCCTGCG TATCGGGTTT
GCCGCGGGCC CTGAGCCGGT TGTAGATCTG CTGGCCCGGC ATCTGCCCTG CTGGAGCGTC
AATACCCTTG CCCAGGCCGC GGTTGACTGG ATTCTGGAGC ACAAACGCGA AGTGAACCGG
TTTATAGACG ATGCCGTGAC CCTGGTGGAA GAGGAACGGT CGTTTCTGCT TCAGCGACTT
GCCGCATCCG GAGTGGTGAG CCTTTTCCCC TCGGTGGCAT CTTTTATGCT GGGCGTCCTG
CATTCGGGCT TTACCTCAGC ATCTGTCTGT GACGCCCTTG CGCAGGGGCG CATTCTGATC
CGGGACTGCG CCAATTTTGA AGGGCTTTCC GACCGGCACA TTCGTATTTC CCTTAAAACA
CGGGAGCACA ACAGCCTGCT GGTCGACCGT TTATTTAACC TGTGTCCATC CTCGTTATGA
 
Protein sequence
MSSNVNPLGP PAGLIAYLCD RMTDIVCLPD PDALHIRRSF AKRHGLAPGH VVAGNGTTQL 
IYALPPALGL GRVLVLGPAY ADYAQACAMH NVPCDFLLAD EAGGFRHDAG VIARKIREAK
PDAVFVCNPS NPTGVLMDRQ VILDLCNESP DVLFVIDESY LPFVREGESL SLINGPGMDN
LLVLSSMSKI FRVPGLRIGF AAGPEPVVDL LARHLPCWSV NTLAQAAVDW ILEHKREVNR
FIDDAVTLVE EERSFLLQRL AASGVVSLFP SVASFMLGVL HSGFTSASVC DALAQGRILI
RDCANFEGLS DRHIRISLKT REHNSLLVDR LFNLCPSSL