Gene Slin_6233 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSlin_6233 
Symbol 
ID8730016 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSpirosoma linguale DSM 74 
KingdomBacteria 
Replicon accessionNC_013730 
Strand
Start bp7558201 
End bp7559421 
Gene Length1221 bp 
Protein Length406 aa 
Translation table11 
GC content35% 
IMG OID 
Productprotein of unknown function DUF262 
Protein accessionYP_003390991 
Protein GI284041061 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones39 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATATCTA TGCTTCTACC AGGTGAAAAA CTCAATTTTA GCGCTTTAGA GAATCTTGAC 
AAAAGACTGT CTAGTACGGA TGAAGAGTCA AATGAAAAAT ACGAAAAAGG GGATGTCCGT
ATTGTAACAG AACAAGCAAG GTACCCTCTT GTTTCGGTTG TCTCTATGGT AACAAGTTCA
GATTACGAGT TGAATCCAGA ATTTCAAAGA AGACACCGGT GGGATAAAAC TAAGCAATCT
AGATTGATAG AATCTTTCAT TATGAATGTC CCTATTCCAC CAATCTTTCT TTATGAAGTA
AAGTATGCAA AGTATGAAGT CATGGATGGA CTACAAAGAC TCACAGCTAT TCACGATTTT
TATAGTGACA ATTTAGTATT AGAAGGATTA GTTGAGTGGC CTGAACTTAA TGGCAAAAAA
TATTCACAGT TACCAACTCA AATAAAAAGT GGCGTAGATA GAAGATACCT ATCCTCAATT
ATATTACTAC AAGAAACTGC AAAGACATAT GACGAAGCAC AAAGATTAAA ACAACTTGTG
TTTGAAAGAA TTAATAGTGG GGGGGTTACT TTGGAACCTC AAGAAGCTCG AAACGCTATA
TACAATGGTA ACCTAAACCA ACTTTGTATC AAGCTAGCAC GCAATGAATA TCTCTGCAGA
ACATGGGATA TACCCTTACC TACTCCTGAA GAAATAAATA CAGGGGAAGT TAGTGATGAG
CTTCTAAAAA ACGAATTTTT TAGAAAGATG TCTGACGTTG AGTTAGTACT GCGTTTTTTC
GCATTCAGAC AAAGACCTGA AAATCCGAAA GGAACTCTAA AAGAATTTTT AGATACGTAC
TTAAAATACG GCAATAATTT CTCTATAGAA TTGCTCTCCC AAATGGAAAT CATCTTCAAC
GACACGATAT CGCTCGTTTA TGAAATTTTG GGCAAGAAAG CTTTTAGACT ATGGAGACAA
AGAAAGAAGC AATCAAATCA ATCATGGGAC TGGTATAATA GACCAGGTAT GACTGCTTAT
GACCCTATAA TGTATGTATT CAGTCAAAAC TTGGGAAGAC GGGAAGAAAT AATAAGCAAA
AAAGAGCAAA TAAATAGTAA TATAACAAAA TTTTATGAAG ATAACTACAT AAGCTTTGGA
GGTCATTCGA CTACTAATAT TAGTAGTATA AACGACCGTA ACAAGGTTTT TGCTCAGTTT
TTGGACAATA TTTTAGAATA G
 
Protein sequence
MISMLLPGEK LNFSALENLD KRLSSTDEES NEKYEKGDVR IVTEQARYPL VSVVSMVTSS 
DYELNPEFQR RHRWDKTKQS RLIESFIMNV PIPPIFLYEV KYAKYEVMDG LQRLTAIHDF
YSDNLVLEGL VEWPELNGKK YSQLPTQIKS GVDRRYLSSI ILLQETAKTY DEAQRLKQLV
FERINSGGVT LEPQEARNAI YNGNLNQLCI KLARNEYLCR TWDIPLPTPE EINTGEVSDE
LLKNEFFRKM SDVELVLRFF AFRQRPENPK GTLKEFLDTY LKYGNNFSIE LLSQMEIIFN
DTISLVYEIL GKKAFRLWRQ RKKQSNQSWD WYNRPGMTAY DPIMYVFSQN LGRREEIISK
KEQINSNITK FYEDNYISFG GHSTTNISSI NDRNKVFAQF LDNILE