Gene Slin_3221 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSlin_3221 
Symbol 
ID8726974 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSpirosoma linguale DSM 74 
KingdomBacteria 
Replicon accessionNC_013730 
Strand
Start bp3900765 
End bp3902573 
Gene Length1809 bp 
Protein Length602 aa 
Translation table11 
GC content46% 
IMG OID 
Productexcinuclease ABC, C subunit 
Protein accessionYP_003388031 
Protein GI284038101 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.119299 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value0.0605848 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCTGAAT TTGATTACAA GCAAGAGTTA GCCAAAGTAC CACACGAACC GGGCGTCTAC 
CGGTATTTTG ACGCAACGGG CGAGGTAATT TATGTTGGTA AAGCCAAAGA CCTGAAAAAC
CGGGTTAGTA GTTATTTTAC CAATTCAAAA GGGCACGATC GCAAAACCCT GCGGCTGGTA
AGCCAGATTC GAAAGATTGA GTTTACCATC GTCAACACTG AATTTGATGC CTTGCTGCTC
GAAAATCAGC TGATCAAGCG GTATCAGCCC AAGTTTAACA TTTTACTGCG CGACGATAAG
ACCTATCCGT TCGTATGTGT CACAAATGAG CACTTTCCGC GGGTTGTAAC GACCCGGCGA
ATCGACCGTA AACTCGGTAC TTTTTACGGC CCTTTCGCGA ACTTAAAGCC CATGTACACC
GTGCTGGATA TGTTCAGCCA GCTGTTTACG ATCCGAACGT GTAATTATAA CCTCGCTCCC
GAGAACATCG AAGCCGGGAA GTATAAAGTT TGTCTGGAAT ACCACATTGG TAATTGCAAA
GGCCCATGTG AAGGCAAACA GGCTGAAGAA GACTACAACT CAGATATTGA ACAGGTCCAC
CATATTCTGA AAGGCAACCT AAAGCCTGCT CAGGAGTACT TCAAGAACCA GATGGTTGAA
GCAGCCAATG ATCTGGCATT TGAGCAGGCA CAGAAGTATA AAGATAAAAT GGAAGTGCTG
CAGCGGTTTC AAAGTAAATC GACTGTTGTT AATCCGAAAA TTGCCGATGC GGATGTGTTC
TCCATTGCGT CAGATGAGGT TTCAGCTTAC ATCAACTTTA TGAAAGTGGT TAACGGAACC
ATCGTCCAGA CGCACACCGT AGAAATCAAG AAAAAGCTCG ACGAAACGGA CCAGGACTTG
ATGGCTATGA TGATCATTGA GTTTCGGGAT CAGTATGGCA GTCAGGCAAA GGAAATTATA
TCGAATATAC CTCTCGATGT TGATTTAAAA GCGGAGGTAA CCGTTCCGCA GATTGGCGAC
AAAAAGAAAC TGCTCGATAT GTCCCTTAAA AACGTGCTTT ATTTCCGGCG CGAAAGGCAG
GAGCGAGCAG CCGCTGAAGC AACGGCCAAT GCCAGTAAAA AAGATCGTGT GTTGATCCGG
CTGAAACAGG ATTTGCAGCT AAAAACATTG CCGAACCGTA TTGAATGCTT TGACAACTCA
AACATTCAGG GCACAAATCC TGTATCGGCA ATGGTATGTT TTATTGGTGG AAAACCCGCG
AATAAAGAGT ACCGCCACTT TTCTATTAAG ACTGTTATTG GGCCAAACGA CTTCGCAAGT
ATGTATGAAG TCGTTACACG ACGGTATACA CGCGTTTTAA CGGAAGATAC CGGCCTTCCT
GACCTGATTG TCATTGATGG TGGCAAAGGC CAGCTCAGTG CCGCCTGCGA CGCGTTAAAA
GACCTCGATC TATATGGTAA AGTGCCAATT ATCGGTATTG CCAAACGGCT TGAAGAGATT
TACTTTCCGG AAGACAACTT ACCACTCTAC ATCGATAAAA AGTCCGAGTC GCTCAAACTT
ATCCAGCGCA TACGCGATGA GGCTCACCGG TTTGCTATTA CCTATCACCG GGATAAACGC
AGCCGCAACA GCCTGATCAG TGAACTGGAG AATGTAGAAG GGGTCGGCAA GAAAACAGCG
GCCAAGCTTT TGAAGCATTT TAAAGGCGTC ACCAAAATTC GGGAGGCCAG CTTTGATGAA
GTGGCCGAAG TTGTGGGTAA AGACCGTGCG GTTAAGCTAA AACAGTATTT TGACACTATT
GAACAATAA
 
Protein sequence
MPEFDYKQEL AKVPHEPGVY RYFDATGEVI YVGKAKDLKN RVSSYFTNSK GHDRKTLRLV 
SQIRKIEFTI VNTEFDALLL ENQLIKRYQP KFNILLRDDK TYPFVCVTNE HFPRVVTTRR
IDRKLGTFYG PFANLKPMYT VLDMFSQLFT IRTCNYNLAP ENIEAGKYKV CLEYHIGNCK
GPCEGKQAEE DYNSDIEQVH HILKGNLKPA QEYFKNQMVE AANDLAFEQA QKYKDKMEVL
QRFQSKSTVV NPKIADADVF SIASDEVSAY INFMKVVNGT IVQTHTVEIK KKLDETDQDL
MAMMIIEFRD QYGSQAKEII SNIPLDVDLK AEVTVPQIGD KKKLLDMSLK NVLYFRRERQ
ERAAAEATAN ASKKDRVLIR LKQDLQLKTL PNRIECFDNS NIQGTNPVSA MVCFIGGKPA
NKEYRHFSIK TVIGPNDFAS MYEVVTRRYT RVLTEDTGLP DLIVIDGGKG QLSAACDALK
DLDLYGKVPI IGIAKRLEEI YFPEDNLPLY IDKKSESLKL IQRIRDEAHR FAITYHRDKR
SRNSLISELE NVEGVGKKTA AKLLKHFKGV TKIREASFDE VAEVVGKDRA VKLKQYFDTI
EQ