Gene Smal_1757 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmal_1757 
SymbolhisD 
ID6475628 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStenotrophomonas maltophilia R551-3 
KingdomBacteria 
Replicon accessionNC_011071 
Strand
Start bp1968370 
End bp1969665 
Gene Length1296 bp 
Protein Length431 aa 
Translation table11 
GC content69% 
IMG OID642730939 
Producthistidinol dehydrogenase 
Protein accessionYP_002028144 
Protein GI194365534 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0141] Histidinol dehydrogenase 
TIGRFAM ID[TIGR00069] histidinol dehydrogenase 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.338237 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value0.0340923 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACCGTC TGATATGGTC CCGACTTGAT GAGGCCGCGC GCAGCGCGGC ACTGACCCGC 
CCGGTGCAGA CCGTCGCGCA GCAGACCCGT GACGCCGTGG CTGCGCTGAT TGCACAGGTG
CGTGCACAGG GCGACGATGC GCTACGTGCA ATCACCGCGC GCTTCGATGG TGTCGAGCTG
GCCTCGTTCG AAGTTTCCGA AGCCGAATTC GCCGCTGCCG ACGCAGCGGT GCCGGCCGAA
TTGCGGCAGG CGATGGTCGA AGCCGCAGAA CGCATCGCGC GCTTCCACGC CGCCGGTATG
GGCAAAGGCT ATGCGGTGGA AACCGCACCG GGCGTGGTCT GCGAACGCAT GCTGCGCCCG
ATCGGCCGGG TGGGTCTGTA CGTACCGGCC GGCAGTGCGC CGCTGCCGTC CACCGCGTTG
ATGCTGGGCG TACCGGCACA GCTGGCCGGC TGCCCGCAGG TGGTGCTGTG CACGCCGCCG
CGCGCCGACG GCAGTGCCGA TCCTGCAGTG CTGGTGGCCG CGCGCCTGAC CGGAGTGCAG
CGCGTGTTCA AGCTGGGCGG CGCGCAGGCA ATCGCGGCGA TGGCCTACGG CACCGCCAGC
ATTCCGGCCT GTGACAAGCT GTTCGGACCG GGCAACAGCT TCGTTACCGA GGCCAAGCAG
CAGATTGCAC AGGACGGCGC GGCGGCCATC GACATGCCGG CCGGTCCGTC GGAAGTGCTG
GTCATTGCCG ACGCCGGTGC CAATCCCGCT TTCGTGGCGG CCGACTTGCT GTCGCAGGCC
GAGCACGGCC CGGATTCGCA GGTCCTGCTG TTGACCGACG ACGCGGCGAT GCTGGCCGCA
GTGGAAGCCG AAGTGGAGCG CCAGGTCGCC TTGCTGCCGC GCCAGCAGAT CGCACGTCAG
GCGCTGTCTG CGTCGCGCCT GATCCAGGTC GATGCGCTGC CTGAAGCCTT CGCCATCAGC
AACCGTTATG CACCCGAGCA CCTGATCCTG GCCCTGCGTG AACCGCGCGA CTGGCTCGGC
CAGGTGCAGG CCGCCGGCTC AGTCTTCCTT GGCGATTACA CCCCCGAAGC GCTGGGCGAC
TACTGCAGCG GCACCAACCA CGTGCTGCCG ACGGCCGGTG CCGCGCGCGC CTACAGCGGC
GTCAGCGTGG CCAGCTTCCA GAACTTGATC AGCGTGCAGA GCGCCAGCGC TGCCGGCCTG
GCGGCGATTG GCGGCTGCGC GCGCATCATC GCCAGCGCCG AAGGCCTGGA TGCGCATGAG
CGTGCGGTTG CGCTTCGCAT GGAGGCGGCG GCATGA
 
Protein sequence
MNRLIWSRLD EAARSAALTR PVQTVAQQTR DAVAALIAQV RAQGDDALRA ITARFDGVEL 
ASFEVSEAEF AAADAAVPAE LRQAMVEAAE RIARFHAAGM GKGYAVETAP GVVCERMLRP
IGRVGLYVPA GSAPLPSTAL MLGVPAQLAG CPQVVLCTPP RADGSADPAV LVAARLTGVQ
RVFKLGGAQA IAAMAYGTAS IPACDKLFGP GNSFVTEAKQ QIAQDGAAAI DMPAGPSEVL
VIADAGANPA FVAADLLSQA EHGPDSQVLL LTDDAAMLAA VEAEVERQVA LLPRQQIARQ
ALSASRLIQV DALPEAFAIS NRYAPEHLIL ALREPRDWLG QVQAAGSVFL GDYTPEALGD
YCSGTNHVLP TAGAARAYSG VSVASFQNLI SVQSASAAGL AAIGGCARII ASAEGLDAHE
RAVALRMEAA A