Gene Slin_5663 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSlin_5663 
Symbol 
ID8729437 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSpirosoma linguale DSM 74 
KingdomBacteria 
Replicon accessionNC_013730 
Strand
Start bp6898077 
End bp6899864 
Gene Length1788 bp 
Protein Length595 aa 
Translation table11 
GC content50% 
IMG OID 
ProductRNA binding S1 domain protein 
Protein accessionYP_003390427 
Protein GI284040497 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.726551 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones34 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCAAAA CGCAGCAACG CGAACTGCCG GCATTTGATT GGGACCGGGC AGACAACAAA 
GGGTTTGGAA GCGGCTATTC GGTTGAAGAA CACAACCGGA TGTTAGAACT TTACGACAAC
ACACTGTCGG AAGTTAAAGA GAAAGAAGTG GTAATGGGAA CCGTCGTTGG GATAACGGAT
CGGGAAGTAC TACTCAACAT CGGCTTCAAG TCGGATGGCT TAGTGCCAGC TTCTGAATTC
CGGGATATGC CGGACCTGAA GATGGGTGAT GAAATTGAAG TTTACGTAGA AAATCAGGAA
GACCCGAACG GTCAGCTGGT GCTTTCTCGC AAAAAGGCGA AAGTGATCAC TGCCTGGCAG
AAAATCCAGC GTGCTCTGGA CGAAGACCTC GTTATCGATG GTTTCGTTAA GCGCCGGACA
AAGGGTGGCC TGATCGTTGA TATTTTCAGC ATTGAAGCGT TCTTGCCAGG TTCGCAGATC
GACGTGAAGC CAATTCGCGA TTTCGACATC TTCGTTGGTA AGAAAATGGA GGTTAAAGTC
GTTAAGATCA ACTATGCAAA TGACAACGTA GTCGTTTCGC ACAAAGTCCT GATCGAGAAA
GACCTCGAAG CACAACGTGC ACAAATCCTG AACAACCTCG AAAAAGGTCA GGTACTGGAA
GGCGTTATCA AGAACATGAC CAACTTTGGT GTGTTCATCG ATCTTGGTGG CGTAGATGGT
CTGTTGCACA TCACGGATAT TTCGTGGGGT CGTATCAGCC ACCCATCCGA AGTACTGCAC
CTCGACCAGA AAGTCAACGT GGTTGTACTC GACTTCGACG AAGACAAGAA GCGTATTTCG
CTGGGCATGA AGCAACTTCA GGCTCACCCA TGGGATGCTC TGGTTGAAGA CATTCAGGTT
GGTTCGAAAG TGAAAGGTAA AATCGTGAAC GTAGCTGATT ACGGCGCGTT CCTCGAAATT
CAGCCTGGTG TTGAAGGCCT GATCCACGTA TCAGAAATGT CGTGGTCGCA GCACCTGCGC
AACCCACAGG AATTCCTGAA AGTTGGTGAC GAAGTAGAAG CACAAGTGCT GACGCTGGAC
CGTAACGACC GTAAAATGTC GTTGGGCATC AAACAACTGA CGGAAGATCC ATGGACTCGT
CCGGAACTGC GCACCAAATA CGCCGTTGGC ACCAAGCACA AAGGCATGGT ACGTAACCTG
ACAAACTTCG GCCTGTTCCT CGAACTGGAA GAAGGTATCG ATGGTCTGGT ACACGTGTCT
GACCTGTCGT GGACGAAGAA GGTGAAACAT CCTTCGGATT TCATTAAGGT TGGCGACGAA
CTCGAAGTGC TGGTACTTGA ACTGGATGTT GACAACCGTC GTCTGGCGCT GGGTCACAAG
CAACTCGAAG AAAATCCTTG GGATACGTTC GAAACCGTAT TCGCCGTTGG TACCGTACAC
CGTTGCACAA TTCTGAACAA GAACGACAAG ATGGCTACCC TCGAACTGCC GTATGGTATC
GAAGGTTTCT CGTCACTCAA GAATCTGGGC AAAGAAGATG GTACCTTCGC TGAAGTTGGC
GAAACGCTTG ACTTCAAAGT AACGGAATTC TCGAAAGAAG AGAAGCGTAT CATGCTCTCG
CACACGAAGA CGTGGCAGGA GAAGAACGAG CCAGTAAAAG AGCAGAAAGC ACCTAAGGCC
GCTCCGGCAA AATCGTCGTC CGCACCAGCT CAGGCTGAGC GTGGCGCTAC GCTGGGTGAT
CTTGATGCAC TGGCTGCATT GAAAGAGCAA CTGGAAGGCC GCAACTAG
 
Protein sequence
MSKTQQRELP AFDWDRADNK GFGSGYSVEE HNRMLELYDN TLSEVKEKEV VMGTVVGITD 
REVLLNIGFK SDGLVPASEF RDMPDLKMGD EIEVYVENQE DPNGQLVLSR KKAKVITAWQ
KIQRALDEDL VIDGFVKRRT KGGLIVDIFS IEAFLPGSQI DVKPIRDFDI FVGKKMEVKV
VKINYANDNV VVSHKVLIEK DLEAQRAQIL NNLEKGQVLE GVIKNMTNFG VFIDLGGVDG
LLHITDISWG RISHPSEVLH LDQKVNVVVL DFDEDKKRIS LGMKQLQAHP WDALVEDIQV
GSKVKGKIVN VADYGAFLEI QPGVEGLIHV SEMSWSQHLR NPQEFLKVGD EVEAQVLTLD
RNDRKMSLGI KQLTEDPWTR PELRTKYAVG TKHKGMVRNL TNFGLFLELE EGIDGLVHVS
DLSWTKKVKH PSDFIKVGDE LEVLVLELDV DNRRLALGHK QLEENPWDTF ETVFAVGTVH
RCTILNKNDK MATLELPYGI EGFSSLKNLG KEDGTFAEVG ETLDFKVTEF SKEEKRIMLS
HTKTWQEKNE PVKEQKAPKA APAKSSSAPA QAERGATLGD LDALAALKEQ LEGRN