Gene Dshi_2088 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDshi_2088 
Symbol 
ID5713083 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDinoroseobacter shibae DFL 12 
KingdomBacteria 
Replicon accessionNC_009952 
Strand
Start bp2212787 
End bp2213905 
Gene Length1119 bp 
Protein Length372 aa 
Translation table11 
GC content71% 
IMG OID641268010 
Producthistidinol-phosphate aminotransferase 
Protein accessionYP_001533426 
Protein GI159044632 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0079] Histidinol-phosphate/aromatic aminotransferase and cobyric acid decarboxylase 
TIGRFAM ID[TIGR01141] histidinol-phosphate aminotransferase 


Plasmid Coverage information

Num covering plasmid clones36 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones43 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCCATA CGCCCGGCCC CCGCTTCACC CCTCTCGCCC AGTCCCTGCC CGCCACGGTC 
CCCTTCGTCG GCCCCGAAAC CCAGGAACGC GCCCGCGGCC GCCCCTTCGC CGCCCGGCTC
GGCGCAAACG AGAGCGCCTT TGGCCCCTCG CCCCGGGCCG TGGCCGCCAT GGCCGAGGCC
GCGACCGGCG CGTGGATGTA TGGCGACCCG GAAAGTCACG ATCTGCGCGC GGCGCTGGCC
GCCCATCACC GGGTCGGGAT GGAGAACGTG ATCGTCGGCG AAGGCATCGA CGGGCTGCTG
GGCTATCTCG TGCGGCTGCT GGTGGCGCCG GGCGATACGG TCGTGACCTC CGCCGGGGCT
TATCCGACCT TCAACTACCA TGTGGCGGGC TTCGGCGGCA CGCTCCATGC GGTGCCTTAC
CGCGACGACC ACGAGGACCC GCAGGCGCTC CTGGACATGG CCCGGGCGGT GGACGCCAAG
GCGATCTATC TCGCCAACCC CGACAACCCC ATGGGCAGCT GGCACGCCGC CGGTGTGATC
ACCGACATGA TTGACGCACT GCCGCCCGGC TGTCTTCTGC TGCTGGACGA AGCCTATATC
GAGCTTGCCC CCGACGGCAC CGCCCCCGAG ATCGCTCCGG ACGACCCCCG CGTCATCCGC
CTGCGCACCT TCTCCAAGGC CCGGGGACTG GCCGGCGCGC GGGTCGGCTA CGGCATCGCC
GCGCCCGGGC TGATTTCCGC CTTCGGCAAG GTGCGCAACC ATTTCGGCAT GAGCCGCGTC
TCGCAAGCCG CGGCCCTCGC CGCGCTACAG GACAGCGACC ACCTCGCGAA GGTGGTGGCC
AAGACCGCCG CCGCCCGCAC CCGGATCGCC GAGATCGGCG CGGCCCATGG CCTGCGCGCC
CTACCCTCGG CCACCAATTT CGTCACGCTG GATTGCGGCG GTGACGGCGC GCGGGCCAAG
GCCATCCTGG AGGCCCTGAT CGCCCGGGAC ATCTTCGTCC GCATGCCCTT CGTCGCCCCC
CAGGACCGCT GCATCCGCAT CTCCTGCGGC ACGCCGGAGA TGCTCGATCT ACTGGCAGAG
CGCCTGCCGG ATGCGCTCGC GGCCGCCACA AAGCCCTGA
 
Protein sequence
MTHTPGPRFT PLAQSLPATV PFVGPETQER ARGRPFAARL GANESAFGPS PRAVAAMAEA 
ATGAWMYGDP ESHDLRAALA AHHRVGMENV IVGEGIDGLL GYLVRLLVAP GDTVVTSAGA
YPTFNYHVAG FGGTLHAVPY RDDHEDPQAL LDMARAVDAK AIYLANPDNP MGSWHAAGVI
TDMIDALPPG CLLLLDEAYI ELAPDGTAPE IAPDDPRVIR LRTFSKARGL AGARVGYGIA
APGLISAFGK VRNHFGMSRV SQAAALAALQ DSDHLAKVVA KTAAARTRIA EIGAAHGLRA
LPSATNFVTL DCGGDGARAK AILEALIARD IFVRMPFVAP QDRCIRISCG TPEMLDLLAE
RLPDALAAAT KP