Gene Slin_5085 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSlin_5085 
Symbol 
ID8728851 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSpirosoma linguale DSM 74 
KingdomBacteria 
Replicon accessionNC_013730 
Strand
Start bp6220224 
End bp6222473 
Gene Length2250 bp 
Protein Length749 aa 
Translation table11 
GC content54% 
IMG OID 
Productcatalase/peroxidase HPI 
Protein accessionYP_003389859 
Protein GI284039929 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.060305 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones34 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGAGATC ATATATATCC AAGCGAATCC TCAGATACTA AAAGTTACAA TGTTAATGGC 
GAGAGCAAAT GCCCGTTTAC GGGTGCGACG GCCAAGCAAA GTGCGGGTTC CGGCACGAGA
AACCGGGATT GGTGGCCTAA TCAGCTTAAG CTAAACGTTC TCCGCCAGCA CTCCCCGCTA
TCCAACCCTA TGGATAAGGC ATTTAACTAC GCTGAGGCTT TCAAATCGCT GGATCTGAAT
GCGGTAAAGA ACGACATTTT CGATCTGATG ACCACATCTC AGGACTGGTG GCCAGCCGAT
TACGGTCACT ATGGCCCTTT CTTCATCCGG ATGGCCTGGC ATAGCGCGGG TACGTATCGA
ATTGCCGATG GCCGTGGTGG AGCAGGTTCG GGAACCCAGC GCTTTGCCCC CCTGAACAGT
TGGCCCGACA ACGCAAACCT CGACAAGGCA CGCTTACTGC TATGGCCTGT CAAGAAAAAA
TATGGTAGAA AGATTTCGTG GGCCGATCTG ATGATTCTTG CTGGTAACTG CGCGCTTGAG
TCGATGGGTT TCAAAACATT CGGTTTCGCC GGTGGACGGG AGGATGTTTG GGAACCGGAA
GAAGATATTT ACTGGGGTGC TGAAACCGAA TGGCTGGGCG ACAAGCGCTA TTCTGGTGAC
CGCGAACTGG AAAATCCGCT GGCTGCCGTA CAGATGGGTC TTATTTACGT AAACCCTGAA
GGACCCAACA GTAGACCGGA CCCGCTGGCA TCTGCCCGCG ACATTCGGGA AACCTTTGGC
CGCATGGCCA TGAATGACGA AGAAACGGTT GCGCTTATTG CCGGTGGACA TACCTTTGGT
AAAACGCACG GCGCAGCTGA TCCGGGCCAG TATGTAGGGG CAGAACCTGC CGGTGCTGGT
ATTGAAGAAC AAAGCCTGGG CTGGAAAAAC ACCTTCGGAA CCGGTAACGC CGGAGACACC
ATCACCAGCG GTCTGGAAGG AGCCTGGACC ACAACGCCAA CGCAGTGGGA TAACAACTAC
TTCGACAATC TGTTCGGGTT CGACTGGGAG CTGACCAAGA GCCCGGCCGG TGCGCATCAG
TGGAGACCGA AAGATGGCGC CGGCGCCGGT ACCGTGCCCG ATGCACACGA CCCGGCCAAG
CGCCATGCAC CCATGATGTT TACGACTGAC CTCGCCCTGC GGATGGACCC TATTTATGAG
CCTATCTCAA GACGTTTTCA CGAAAATCCA GATCAATTTG CCGACGCCTT TGCCCGTGCC
TGGTTTAAGC TGACTCACCG CGATATGGGC CCGATTGCCC GCTATCTCGG TCCCGAAGTA
CCCACCGAAG AACTGATCTG GCAAGATCCA ATCCCGGCCG TTACGCATCC TTTGATTGAT
GAACAGGATA CGGCTGCATT GAAAAAAATG ATACTGGCTT CGGGCCTGTC TGTTTCTCAA
CTGGTATCTA CCGCCTGGGC CTCTGCATCG ACTTTCCGTG GGTCCGATAA ACGCGGTGGT
GCCAATGGGG GACGCCTTCG GCTGGCACCG CAGAAGGATT GGGATGTCAA CCATCCGGGT
CTGCTGGCAA CTGTACTGGA AAAGTTGGAA GGTATTCAAA TCGACTTCAA CAGTATGCAA
CAGGATGGAA AGCAGGTTTC TCTTGCGGAC CTGATCGTGC TGGGTGGCAG TGTAGGTATT
GAGCAAGCGG CTAAAAAAGC TGGTCATGAG GTGACAGTAC CGTTCACGCC CGGACGCGCC
GATGCATCGC AGGAACAGAC CGATGTTGAG TCGTTCGCCG TTTTGGAACC GGAATCAGAC
GGTTTCCGCA ACTACTCCAA GACGAAATAC ACCGTGTCGG CAGAAGAAAT GCTGATTGAT
AAAGCACAAT TACTAACGCT GAACGCCCCA GAAATGACCG TTCTGGTTGG CGGCATGCGG
GTTCTGAACA CGAACTACGG TTTTTCCAAA CACGGTGTAT TTACAAAGCG CCCGGAGGCC
CTTACCAACG ACTTTTTCGT TAACCTGCTC GATCTCGGTA CGACCTGGAA GGCAGCCTCG
CAACACCAGG ATGTGTTTGA AGGCCGTGAC CGGACAACAG GCGAATTGAA ATGGACTGGT
ACCCGAGTCG ATCTTATTTT TGGTTCTAAT TCAGAACTCC GGGCACTTGC TGAAGTGTAC
GCCTGCGAAG ATGCACAGGA GCAGTTTGTA CAGGATTTTG TAGCGGCATG GACCAAAGTG
ATGAATCTCG ATCGCTTCGA TCTGGCCTGA
 
Protein sequence
MGDHIYPSES SDTKSYNVNG ESKCPFTGAT AKQSAGSGTR NRDWWPNQLK LNVLRQHSPL 
SNPMDKAFNY AEAFKSLDLN AVKNDIFDLM TTSQDWWPAD YGHYGPFFIR MAWHSAGTYR
IADGRGGAGS GTQRFAPLNS WPDNANLDKA RLLLWPVKKK YGRKISWADL MILAGNCALE
SMGFKTFGFA GGREDVWEPE EDIYWGAETE WLGDKRYSGD RELENPLAAV QMGLIYVNPE
GPNSRPDPLA SARDIRETFG RMAMNDEETV ALIAGGHTFG KTHGAADPGQ YVGAEPAGAG
IEEQSLGWKN TFGTGNAGDT ITSGLEGAWT TTPTQWDNNY FDNLFGFDWE LTKSPAGAHQ
WRPKDGAGAG TVPDAHDPAK RHAPMMFTTD LALRMDPIYE PISRRFHENP DQFADAFARA
WFKLTHRDMG PIARYLGPEV PTEELIWQDP IPAVTHPLID EQDTAALKKM ILASGLSVSQ
LVSTAWASAS TFRGSDKRGG ANGGRLRLAP QKDWDVNHPG LLATVLEKLE GIQIDFNSMQ
QDGKQVSLAD LIVLGGSVGI EQAAKKAGHE VTVPFTPGRA DASQEQTDVE SFAVLEPESD
GFRNYSKTKY TVSAEEMLID KAQLLTLNAP EMTVLVGGMR VLNTNYGFSK HGVFTKRPEA
LTNDFFVNLL DLGTTWKAAS QHQDVFEGRD RTTGELKWTG TRVDLIFGSN SELRALAEVY
ACEDAQEQFV QDFVAAWTKV MNLDRFDLA