Gene Slin_0284 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSlin_0284 
Symbol 
ID8724012 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSpirosoma linguale DSM 74 
KingdomBacteria 
Replicon accessionNC_013730 
Strand
Start bp375003 
End bp376613 
Gene Length1611 bp 
Protein Length536 aa 
Translation table11 
GC content51% 
IMG OID 
Productmetal dependent phosphohydrolase 
Protein accessionYP_003385147 
Protein GI284035217 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.876396 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones38 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACATTC CAATTTTGTT AACGATTCTT GCCGCCCTTG CAGGGGGTGG TATTGGCATA 
TTAATTGGTC GCCAAACCAT GGCGGGCGTT CGCGCGAAGC ATGAGAAAGA TGCAGAAGAA
AAAGCAGCAG CTATTTTAAA AAATGCTGAA TTACAGGCCG AAACGATAAA AAAAGATCGG
ATACTGGAAG CAAAAGAAAA ATACCTGAAG CTGAAAACGG AGTTCGAAGA AACAACAAAT
CAGAAGCGGA ACCTGCTTCT GCAGAACGAA ACCAAGCTCA AACAGCGCGA ACAACAACTT
GCCCAACAGG CCGATCAGCA GCGCACCCGT GAAGGTGAGC TGAACCAGCA GCGTAATGAA
CTCGGGCAGC AAAAAAACAC GCTAGCGCAA CAGATTGAGG CTCTTAATAA ACGCCGGGAA
GACGTTGACC GTCGGCAGCA GGAAGCCGAC CGTATGCTCG CCGATCAGGT GGCTCAGCTC
GAAAAAATTG CAGGTCTGTC TGCCGAGCAG GCACGTGAGC AACTCATAGA AACGCTGAAA
GCCGAGGCCG AAACACGGGC TTCCTCCTAC ATCAAAAATA TTATTGAAGA AGCTAAGCTG
ACCGCTACTA AAGAAGCGAA AAAGGTGGTT ATTGAAACCA TTCAGCGAAC GGCTACCGAG
CACGCCATTG AAAACTGTGT GTCCGTTTTC AACATTGAAT CGGATGATGT AAAGGGCAAA
GTTATTGGCC GGGAAGGTCG TAACATTCGT GCCCTCGAAG CAGCAACCGG CGTTGAAATT
ATCGTCGATG ATACCCCCGA AGCCATTATC ATTTCGGGCT TCGATCCCGT TCGGCGCGAG
ATTGCCCGGC TCTCCCTGCA CCGGCTCGTA CAGGACGGTC GTATCCACCC CGCCCGGATT
GAAGAGATCG TTGCCAAAAC CCGCAAAAAT ATTGAAGACG AAATTGTTGA GATCGGCGAA
CGGACTGTCA TCGACCTCGG CATTCACGGT CTTCACCCCG AGCTGATCAA GATGGTTGGC
CGAATGCGCT TCCGGTCAAG TTACGGGCAA AACCTGCTCC AGCACTCCCG CGAAGTAGCC
AAACTGTGCG CCACTATGGC GGCTGAACTG GGCCTGAATG CCAAGCTCGC CAAGCGGGCT
GGATTGCTTC ACGATATTGG CAAGGTGTGG CCCGAAGAAG CTGAACTACC CCACGCCATA
TTGGGCATGG AGCTTGCCAA GAAATACAAG GAGAATCCGG AAGTTATCAA TGCTATCGGC
GCTCACCACG ACGAGATCGA GATGACGAGT ATGATTTCGC CAATTGTGCA GGTTTGTGAC
GCCGTATCGG GCTCACGGCC GGGTGCCCGT CGCGAGATGA TGGAGTCGTA CATTAAACGA
CTTAAAGAAC TGGAAGAACT GGCCGGAAAT TTTCCGGGCG TAACCAAGTG CTATGCTATT
CAGGCCGGTC GCGAGTTACG GATTATGGTC GATGCTGATC ATGTTTCCGA TGAGCGTGCG
GGTATTCTGT CGTATGAAAT TTCACAAAAA ATAGAGAAGG AGATGCAGTA TCCCGGTCAG
ATCAAAGTAA CGGTCATCCG GGAAATGCGG GCAGTAGCCT ACGCCAAGTA G
 
Protein sequence
MDIPILLTIL AALAGGGIGI LIGRQTMAGV RAKHEKDAEE KAAAILKNAE LQAETIKKDR 
ILEAKEKYLK LKTEFEETTN QKRNLLLQNE TKLKQREQQL AQQADQQRTR EGELNQQRNE
LGQQKNTLAQ QIEALNKRRE DVDRRQQEAD RMLADQVAQL EKIAGLSAEQ AREQLIETLK
AEAETRASSY IKNIIEEAKL TATKEAKKVV IETIQRTATE HAIENCVSVF NIESDDVKGK
VIGREGRNIR ALEAATGVEI IVDDTPEAII ISGFDPVRRE IARLSLHRLV QDGRIHPARI
EEIVAKTRKN IEDEIVEIGE RTVIDLGIHG LHPELIKMVG RMRFRSSYGQ NLLQHSREVA
KLCATMAAEL GLNAKLAKRA GLLHDIGKVW PEEAELPHAI LGMELAKKYK ENPEVINAIG
AHHDEIEMTS MISPIVQVCD AVSGSRPGAR REMMESYIKR LKELEELAGN FPGVTKCYAI
QAGRELRIMV DADHVSDERA GILSYEISQK IEKEMQYPGQ IKVTVIREMR AVAYAK