Gene Daro_1233 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDaro_1233 
Symbol 
ID3569417 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDechloromonas aromatica RCB 
KingdomBacteria 
Replicon accessionNC_007298 
Strand
Start bp1339707 
End bp1340795 
Gene Length1089 bp 
Protein Length362 aa 
Translation table11 
GC content59% 
IMG OID637679700 
Producthistidinol-phosphate aminotransferase 
Protein accessionYP_284459 
Protein GI71906872 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0079] Histidinol-phosphate/aromatic aminotransferase and cobyric acid decarboxylase 
TIGRFAM ID[TIGR01141] histidinol-phosphate aminotransferase 


Plasmid Coverage information

Num covering plasmid clones62 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCCTTG CCGATCAAGC GCTGTCTTAC GTTCGCGCCA TTTCGCCCTA TCAGCCGGGC 
AAGCCGATCA CCGAGCTGGC CCGTGAAATG GGTATTCCGG TCGAGAAAAT CGTCAAGCTG
GCCTCCAATG AGAACCCGCT GGGCATGAGC CCGAAGGCCA GAAAGGCTGT TGAAGCAGCG
ATTAGTGGCA TCGAACGCTA CCCGGATCAG TTTGATCTGA TCGCCAAGGT GGCAGAGCGT
TGCGGCGTTT CGAGCAACCA GATTGTGCTT GGCAATGGCT CGAATGACGT GCTCGACCTG
ATCGCCCGCG TTTTTCTGGC GCCAGGCCGA TCGGCGGTGT TTGCCCAGCA CGCCTTCGCC
GTTTATCCAT TGGCCACGCT GTCGACTGGT GCCGAGCTGA TCTCGACGCC AGCCAAGAAC
TACGGCCATG ACCTGAATGC CATGCGTGCT GCCATCCGCC CGGATACGCG CATTGTCTGG
ATTGCCAATC CGAACAACCC GACCGGTAAC TTCCTGCCGT ATCCGGAAGT TCGCGCCTTT
CTGGAGGTTG TGCCCAAGGA TGTCGTGGTC GTCCTCGACG AGGCCTACAA CGAATATATT
CCGCCGGCCG AACGGGTCGA TACCGCCACC TGGATCAAGG ATTTTCCGAA CCTTGTGGTC
TGCCGTACTT TCTCCAAGAT TTTCGGGCTG GCCGGTCTGC GTGTCGGCTA TGCGCTGGCT
TCGACCGAGG TGGCTGACCT GATGAACCGT ATCCGTCAGC CGTTCAACGT CAATAACCTG
GCAATTGCCG CCGCCGTTGC CGCGCTCGAC GACCATCTGT TTGTGGCTGA CAGCTACGAA
CTCAATCGTC GGGGCATGGA ACAGATTATT GCCGGCCTGA AGCGCTTCGG GCTGGAGCAT
ATTCCGTCGC ACGGCAACTT CGTGACCTTC CGGGCAGGCG ATGCGGCGGT AGTGAACCAG
AAATTGCTGA AGCAGGGCGT TATCGTTCGT CCGATTGGCG GCTATGGCCT GCCGGAGTGG
CTACGCGTCA CCATTGGCAC GGAGCCGGAG AACGCCCGCT TCCTGGAAGC GCTGGAGAAG
GCGCTTTAA
 
Protein sequence
MSLADQALSY VRAISPYQPG KPITELAREM GIPVEKIVKL ASNENPLGMS PKARKAVEAA 
ISGIERYPDQ FDLIAKVAER CGVSSNQIVL GNGSNDVLDL IARVFLAPGR SAVFAQHAFA
VYPLATLSTG AELISTPAKN YGHDLNAMRA AIRPDTRIVW IANPNNPTGN FLPYPEVRAF
LEVVPKDVVV VLDEAYNEYI PPAERVDTAT WIKDFPNLVV CRTFSKIFGL AGLRVGYALA
STEVADLMNR IRQPFNVNNL AIAAAVAALD DHLFVADSYE LNRRGMEQII AGLKRFGLEH
IPSHGNFVTF RAGDAAVVNQ KLLKQGVIVR PIGGYGLPEW LRVTIGTEPE NARFLEALEK
AL