Gene Slin_5542 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSlin_5542 
Symbol 
ID8729315 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSpirosoma linguale DSM 74 
KingdomBacteria 
Replicon accessionNC_013730 
Strand
Start bp6751317 
End bp6753149 
Gene Length1833 bp 
Protein Length610 aa 
Translation table11 
GC content53% 
IMG OID 
ProductDNA mismatch repair protein MutS domain protein 
Protein accessionYP_003390307 
Protein GI284040377 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones32 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCCCAG AAACCACTTT TCTCGACCGT CAGCAGGAGT TCACTCAAAA AGAACAGGCT 
GCTCAAAGCA ACTACAATCA ACTGGCCTTC TGGCGGCTTA TCTGGTTTGT TGGTGCCGTT
GCCGGAGTTT GGCTGCTGGC CCGGTTCGAT CAGCAGCTAG CTGCGGCTGG CGTTTTGCTG
GTCGGCCTGA TTGGGTTTAT GCTGCTATTG AAAAAGCATC AGACCATTCG TCAGGAACGG
GATTTGTATC ACCAGTTGGC CTTTGTTAAT CAGGACGAAG TGGCCCGGCT AAAACGCCAG
TACCTGCGCC CGGAAACCGG CGAACAGTTC TCAAGCCCTA CTCACTCTTA TGCCGGCGAT
TTAGACGTAT TCGGCAAGCA CTCACTGTTC CGGCTGCTCA ATCGCACACA CACATACGAG
GGGCAGAGAC GTCTGGCAAA GTGGCTGAAG GCACCTTCTG CCCCAGACGC TATCCGATTG
CGTCAGGAGG CTGTAGCGGA GCTTAAACCG CAACTGGAAT GGCGTCAGCA GTTGGAAGCG
CTGGCCTATG CAGAGCCAAC CATCAACCAG TCGCCGGATG CCCTCGTGAA ATGGGCAACA
GCAGAAAGCG AGCCGCTAGC GGGTTACTTA TCGATTGTCC GTTTTCTCTT TCCCGCCATT
ACCATAGGTT TGTTTATCGG GTGGCTACTG GGTTACGTTC AGGGAGCCGC CGTGTTGCTG
GCACTGGCGG GGCATGGTCT TGTGCTCAGT CAAATCTCAG CGCGTGCTAA AGGGGTTAGC
GAGCAAACAT TCGAAATAGC AACGGCGCTG CGGGCGTATC AAGCGCTGCT TAAGCAAGCC
GAAGCGGTAA AGGGAGATAC TGTTCGATTG CGCGCCATCC GACAGGCGCT AACATCTGAT
ACTAAACTAG CTGCTTCGGC AGCCATTGGC CAGCTCGGAC GGCTTACCGA GGGGCTGAAC
TTCCGCCGAA ATCCTTATTT CGCCTTGCTG ATTGGTGTAG CAACACTTTG GGATATTCAC
TATTTGATAA AGCTTGAACA TTGGCGACAA ACGCATGGAC CGGCACTCAG TTTGTGGTTC
GAGGCACTGG GTGAGCTGGA AGCCCTCAAT AGCCTCTGTG GTTTCGCGTA CGCGCACCCG
TCCTATGCAA CTCCCGAAAT CGTTGATGAT AAGTTTGTAT TGGAATTAAC CTCGGCAGCC
CATCCGTTAC TAGCAGAAAA TAACAGCGTC GCTAACTCAC TTATTCTGCG TGGTGCCGGA
CAAACCGTCC TGATTACCGG CTCCAATATG TCGGGGAAAA GCACGTTTCT GCGGACGGTA
GGTACAAACG TAGTGCTGGC ATTAGCGGGG GGCGTGGTGC GTGCCGAACG CTTTCGGTGT
TCGCCCGTAC AGGTGTTTAC GAGTATGCGC ACACAGGACT CACTCGAAGA AAGCACATCG
TCGTTCTACG CCGAATTGAA ACGCCTGCAA ACGCTTATTG GCCTGACAAA CCCGGATAAG
TCGGCCTCAG TTTCTTCTAA AAATACCCTG CCTGTTCTCT ATTTTCTGGA TGAGATCCTG
AAAGGTACGA ACTCCGCCGA CCGCCATCGG GGCGCTGAGG CCCTTATTCG TCAGTTGCAC
CACACAATGG CATCTGGCTT TGTGTCTACC CATGATCTTG AGCTGGGTCA ACTTACCGAT
GCTGACGGCT TTGTGCGTAA CTACCACTTC CAGTCGGACC TTGTCAATGG CGAGCTTGTG
TTCGACTATA AACTCCGGGA TGGTATCTGC AAAAGTTTCA ACGCCAGCCA GCTGATGCGG
GCCATTGGCA TTGAGATGGA TGCGGTGAAA TAG
 
Protein sequence
MPPETTFLDR QQEFTQKEQA AQSNYNQLAF WRLIWFVGAV AGVWLLARFD QQLAAAGVLL 
VGLIGFMLLL KKHQTIRQER DLYHQLAFVN QDEVARLKRQ YLRPETGEQF SSPTHSYAGD
LDVFGKHSLF RLLNRTHTYE GQRRLAKWLK APSAPDAIRL RQEAVAELKP QLEWRQQLEA
LAYAEPTINQ SPDALVKWAT AESEPLAGYL SIVRFLFPAI TIGLFIGWLL GYVQGAAVLL
ALAGHGLVLS QISARAKGVS EQTFEIATAL RAYQALLKQA EAVKGDTVRL RAIRQALTSD
TKLAASAAIG QLGRLTEGLN FRRNPYFALL IGVATLWDIH YLIKLEHWRQ THGPALSLWF
EALGELEALN SLCGFAYAHP SYATPEIVDD KFVLELTSAA HPLLAENNSV ANSLILRGAG
QTVLITGSNM SGKSTFLRTV GTNVVLALAG GVVRAERFRC SPVQVFTSMR TQDSLEESTS
SFYAELKRLQ TLIGLTNPDK SASVSSKNTL PVLYFLDEIL KGTNSADRHR GAEALIRQLH
HTMASGFVST HDLELGQLTD ADGFVRNYHF QSDLVNGELV FDYKLRDGIC KSFNASQLMR
AIGIEMDAVK