Gene Slin_3446 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSlin_3446 
Symbol 
ID8727199 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSpirosoma linguale DSM 74 
KingdomBacteria 
Replicon accessionNC_013730 
Strand
Start bp4178501 
End bp4179649 
Gene Length1149 bp 
Protein Length382 aa 
Translation table11 
GC content51% 
IMG OID 
Producthistidinol-phosphatase 
Protein accessionYP_003388253 
Protein GI284038323 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones46 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAAAAAA TAGTATTTAT TGACCGCGAC GGGACGCTTA TTGCCGAGCC ACAACCCGAT 
CAACAGGTTG ACTCACTCGC CAAACTGGAT TTCATTCCAA AAGCTATTTC GGCCATGCGG
AAAATTGCCG AAGATACTAC GTATGAACTC GTTATGGTCA CTAATCAGGA TGGACTGGGT
ACCGGCTCCT TCCCCGAAGA TACGTTCTGG CCAGCTCATA ACAAAATGAT GTCCACATTT
GCCGGCGAAA ACGTCAACTT TGCGGCTGTG CATATCGACC GTCATTTCCC GCACGATAAT
TCGTCTACCC GGAAACCCGG CGTTGGTATG TTAACGCAGT ATTTCGAGGC TTCGTATGAC
CTGACCAACA GTTTCGTTAT TGGTGACCGG CTAACCGATG TTCAACTGGC TGTAAATCTG
GGTGCTAAAG CTATCCTGTT CATGCCCCCC AACGGATTAG CAGCCGTACA ATCCGCTGAT
GTCAGTGGGT TGACCGAAGC CATGAAACAG GCCATTGTAC TCCAGACCGG CGACTGGGAC
GAGATCTACG AATTTTTGCG CCTGCCCGCC CGCACGGCCC TTGTTGAGCG GAATACAAAA
GAGACGCAAA TCCGCGTGGA GTTAAACCTC GATGGCCGGG GCCGGGCCGA TATGCATACC
GGGCTTGGCT TTTTCGACCA CATGCTCGAT CAGGTAGCCA AACATTCGGG TGCCGACCTG
GCGATCCATG TCAACGGAGA TTTGCACATT GATGAACATC ACACGATAGA AGACACGGCC
CTGGCGCTCG GTGAAGCCTA TCGACGTGCC TTAGGCGATA AACGTGGCAT CAGCCGTTAT
GGGTTCCTGC TGCCAATGGA TGAAGCCCTG GCGCAGGTGG GCATTGATTT TTCGGGCCGT
CCGTGGCTGG TTTGGGATGC CGAGTTCAAG CGGGAGAAGA TCGGCGACAT GCCAACCGAG
ATGTTTTATC ATTTCTTTAA ATCGTTTTCC GATACAGCAC TTTGCAACCT AAACATTAAA
GTGGAAGGCG ATAATGAACA CCATAAAATC GAAGCCATTT TCAAGGCGTT CGCCAAGGCG
ATAAAAATGG CCGTTCGACG CGACATCAAT GAATTAGATA ACCTTCCCAG CACGAAGGGC
GTTTTATAA
 
Protein sequence
MQKIVFIDRD GTLIAEPQPD QQVDSLAKLD FIPKAISAMR KIAEDTTYEL VMVTNQDGLG 
TGSFPEDTFW PAHNKMMSTF AGENVNFAAV HIDRHFPHDN SSTRKPGVGM LTQYFEASYD
LTNSFVIGDR LTDVQLAVNL GAKAILFMPP NGLAAVQSAD VSGLTEAMKQ AIVLQTGDWD
EIYEFLRLPA RTALVERNTK ETQIRVELNL DGRGRADMHT GLGFFDHMLD QVAKHSGADL
AIHVNGDLHI DEHHTIEDTA LALGEAYRRA LGDKRGISRY GFLLPMDEAL AQVGIDFSGR
PWLVWDAEFK REKIGDMPTE MFYHFFKSFS DTALCNLNIK VEGDNEHHKI EAIFKAFAKA
IKMAVRRDIN ELDNLPSTKG VL