Gene Daro_3383 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDaro_3383 
Symbol 
ID3567113 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDechloromonas aromatica RCB 
KingdomBacteria 
Replicon accessionNC_007298 
Strand
Start bp3637400 
End bp3638470 
Gene Length1071 bp 
Protein Length356 aa 
Translation table11 
GC content61% 
IMG OID637681855 
Producthistidinol-phosphate aminotransferase 
Protein accessionYP_286582 
Protein GI71908995 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0079] Histidinol-phosphate/aromatic aminotransferase and cobyric acid decarboxylase 
TIGRFAM ID[TIGR01141] histidinol-phosphate aminotransferase 


Plasmid Coverage information

Num covering plasmid clones50 
Plasmid unclonability p-value0.495635 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTCGCT TCTGGAGCCA GGTCGTCCGT GACCTCACCC CCTACGTACC GGGCGAGCAG 
CCCAAGATCG CCAACCTGAT CAAGCTCAAC ACCAACGAGA ACCCGTTCCC GCCCTCACCC
AGGGTGGTGG CGGCGATTCA GGCAGAACTT GGTGACGATG CGGCGCGCCT GCGTCTCTAC
CCCGACCCGA ATGCCGACTT GCTCAAGGCC GCGGTAGCCA GGAGGCACAC CGTTTCGGCG
CAACAGGTCT TCGTCGGCAA CGGCTCGGAC GAAGTGCTGG CCCATATCTT CATGGCGCTG
CTCAAGCACG ACCAGCCCAT TATCTTCCCC GATATCACCT ACAGCTTCTA CCCGGTCTAT
TGCGGGTTGT ACGGCGTCGA ATATCAGACG CTGCCGCTGG CTGATGATTT CTCGATCAAC
CCGGCTGACT ACTGTGACCG TCCGAATGGC GGCATCATCT TCCCCAACCC GAATGCACCG
ACCGGCCGTC TGCTGCCGCT CGATGCCATT GAGCAGATGC TCAAGGCCAA TCCGGACTCC
GTCGTCGTCG TCGATGAAGC CTATGTCGAT TTCGGTGGCG AAACGGCTAT TTCACTGGTC
GACCGCTACG ATAACTTGCT GGTCGTCCAC ACCCTGTCAA AGTCACGCTC GCTGGCCGGC
ATGCGCGTCG GCTTCGCCGT CGGCCATGCC GCACTGATCG AAGCACTGGA ACGCGTCAAG
AACAGCTTCA ACTCCTACCC ACTGGATCGT CTGGCCATCG TCGCTGCCGT CGCAGCGATG
GAAGATGAGG CTTATTTTGC GCAATGCTGT CATGCGGTGA TGGCCACCCG CAACACGCTG
ACCGCTGAAC TCACTGAACT CGGCTTCGAA GTCCTGCCCT CCACCGCCAA CTTCATCTTC
ACCCGTCACC CCCAACGCGA CGCAGCTGAG CTGGCCAAGG CGCTACGCGA GCGCAACATC
ATCGTGCGCC ACTTCAAACT GCCACGCATC GACCAGTTCC TGCGCATTAC CGTCGGTACC
GACGGCGAAT GCAAGGCGCT GACTGACGCC CTGCGCCAGA TCACCGGCTG A
 
Protein sequence
MSRFWSQVVR DLTPYVPGEQ PKIANLIKLN TNENPFPPSP RVVAAIQAEL GDDAARLRLY 
PDPNADLLKA AVARRHTVSA QQVFVGNGSD EVLAHIFMAL LKHDQPIIFP DITYSFYPVY
CGLYGVEYQT LPLADDFSIN PADYCDRPNG GIIFPNPNAP TGRLLPLDAI EQMLKANPDS
VVVVDEAYVD FGGETAISLV DRYDNLLVVH TLSKSRSLAG MRVGFAVGHA ALIEALERVK
NSFNSYPLDR LAIVAAVAAM EDEAYFAQCC HAVMATRNTL TAELTELGFE VLPSTANFIF
TRHPQRDAAE LAKALRERNI IVRHFKLPRI DQFLRITVGT DGECKALTDA LRQITG