Gene Slin_1784 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSlin_1784 
Symbol 
ID8725521 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSpirosoma linguale DSM 74 
KingdomBacteria 
Replicon accessionNC_013730 
Strand
Start bp2156786 
End bp2158075 
Gene Length1290 bp 
Protein Length429 aa 
Translation table11 
GC content56% 
IMG OID 
Producthistidinol dehydrogenase 
Protein accessionYP_003386628 
Protein GI284036698 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.112475 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones33 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACATCA TCCCTTTTCC CGACCGTGCT GAATGGCCTG CGCTGCTGGC TCGCCCGGTG 
CAGTCGACAC AACAGATTGA AGCGGTAGTA GCGCCTATTC TTGCTCAGGT TCGTGCCGGG
GGCGATGCCG CTTTGGTTGA ACTGGCTCAG AAGTTTGATA AAGTCGACCT GTCGGCGGAG
GGGATGGAGG TGCCTGCTTC GGAACTCGAT GCCGCCGAAC GTCAACTGAG TGATACGCTA
AAGGAGGCCA TCCGGCAGGC CTACCAGAAC ATCCGGCTTT TTCACGAGCG GCAAAAGCAG
CCCATTGAAA AAGTTGAAAC CATGCCGGGG GTCGTATGCT GGCGTAAAAG CGTTGGCATC
GAAAAAGTAG GGCTGTACAT ACCGGGCGGC ACTGCGCCCC TGTTCAGCAC GGTGCTCATG
CTGGGCATTC CGGCCCAGTT GGCAGGTTGC CGCGAGGTAG TTCTTTGTAC GCCCAGCAAC
CACCCGGCCA TTTACTTTGC AGCCAAACTC GTTGGTATCA CGAAGGTATT TCGGATTGGG
GGTGCTCAGG CCATTGCAGC CATGGCCTAC GGAACGGAAT CGGTTCCGCA GGTCTACAAG
ATTTTTGGTC CCGGTAACCA ATACGTAACG GCGGCCAAGA TGCTGGTTGC CAAAGAAGGT
ATGGCCATCG ACATGCCCGC CGGACCCAGC GAAGTGGCCG TGTATGCCGA TGATTCAGCG
GTGCCGTCGT TTGTTGCCGC CGACTTGCTC TCGCAGGCCG AACACGGTGC TGACAGCCAG
GTACTGCTGG TATCGACCAG TAAAAAACTA GTGTCGATGG TCAATCTGGT ACTGCCCACG
CAATTGAGCA AGCTCCCCCG TCGCGAACTG GCTGCCAAAG CCCTTGAGAA CAGCAAGGCA
ATTCTGGTCG ATACGGAAGC GGATGCCATT GACCTGTTAA ACGCTTATGC AGCCGAACAC
CTGATTCTGA GCGTCGAAAA TGCCGAAGCC GTCGCGGAGA AAATTATTAA TGCAGGCTCT
ATTTTTCTGG GCAACTACAC GCCTGAATCG GCCGGCGATT ACGCGTCGGG TACCAACCAT
ACCTTGCCAA CAAACGGATT TGCAAGGGCC TACAGTGGGG TATCGCTGGA TAGTTTCGTG
AAGAAGATCA CCATTCAGCA CATCACGCCG GAAGGTATCC AAAACCTCGG GCCGGTTGTG
GAAGCCATGG CCGAAGCTGA ATCGCTCGAC GCCCACAAAC GCGCCGTCAG CCTGCGAATG
GCAAGCCTGA GCGAAGTTAA TCCGGTCTGA
 
Protein sequence
MNIIPFPDRA EWPALLARPV QSTQQIEAVV APILAQVRAG GDAALVELAQ KFDKVDLSAE 
GMEVPASELD AAERQLSDTL KEAIRQAYQN IRLFHERQKQ PIEKVETMPG VVCWRKSVGI
EKVGLYIPGG TAPLFSTVLM LGIPAQLAGC REVVLCTPSN HPAIYFAAKL VGITKVFRIG
GAQAIAAMAY GTESVPQVYK IFGPGNQYVT AAKMLVAKEG MAIDMPAGPS EVAVYADDSA
VPSFVAADLL SQAEHGADSQ VLLVSTSKKL VSMVNLVLPT QLSKLPRREL AAKALENSKA
ILVDTEADAI DLLNAYAAEH LILSVENAEA VAEKIINAGS IFLGNYTPES AGDYASGTNH
TLPTNGFARA YSGVSLDSFV KKITIQHITP EGIQNLGPVV EAMAEAESLD AHKRAVSLRM
ASLSEVNPV