Gene Slin_3352 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSlin_3352 
Symbol 
ID8727105 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSpirosoma linguale DSM 74 
KingdomBacteria 
Replicon accessionNC_013730 
Strand
Start bp4055969 
End bp4057765 
Gene Length1797 bp 
Protein Length598 aa 
Translation table11 
GC content51% 
IMG OID 
Productpeptidase M61 domain protein 
Protein accessionYP_003388161 
Protein GI284038231 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.780487 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value0.430349 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAATCT TACGCTTGTT GTTGCTGGCG TCCAGCACAC TCCTTTTTCA ATTCTCGACG 
GCCAAACCCA CTACGCCCCT GGTGTATGAA GTGAACCTCA ACGACCGCGC CGACGATCAG
TTTAAAGTGA CGCTGCGCGT TAGCGGCCTG ACAGCCGCCA ATGCCGTTTA CCAGTTTGCA
TCCACTGCCC CCGGTACCTA TCAGGTGATG GATATTGGCC GCTATGTTCG ATCCTTCAAA
GCATTCGACG CCAAAGGGCG CGAACTCAAA ACGCAGCAGG TATCGACCAA CCAGTGGCAG
TTTGAGAAAC CCGAAAATGT GCGGACTGTC CAGTACAGCA TCGCCGAAAC CTGGGACACA
CCCGTTAACG AACACAAGCC GTACAACATG TGCGGTACGT CTATCGAAAA AGATCACGTG
CTCATTAACG GACAGGGCGT ATTTGGCTTC CCCACCGGTA TGCAGGACGC ACCCATCGAC
GTAAAACTCA ATTACCCAGC CGAATGGTCG GTGGGAACAG CGCTGGAGAA AAACGCAAAA
GGGTACTTTA CGGCGGCCAA TTATGACCGG ATAGTGGATT CGCCTATCTT GCTGGGTCGC
CTGACCAAAG CCACGACAAC TGTGGCCGGA GCCCAGATCG ACGTGTACAC CTATTCAAAA
AGCGATAAGA TTAAGTCCGA TCAACTGCTC ACGAATATGC AGTCCATGCT CAATGCCGCC
GGTCAGTTTT TGAAGCAGTT GCCCGTAAAA CGGTATACCT TTCTGTATCA CTTCGAAGAT
CAGGACTGGG GCGCGTGGGA GCACTCCTAC AGTTCTGAAT ACGTGATCAA AGAAGAGGAA
TTCTCGAAAA AGCTGGCCGA CAATATGACC TCCATAGCCG CCCATGAATT CTTCCACGTC
GTCACACCCC TCAACATCCA TAGCGAAATT ATCCAGCAGT TTAACTTCGT AACGCCCACC
CCCTCCGAAC ACCTGTGGCT GTACGAAGGC GTAACCGAAT GGGCCAGCGA TGCCATGCAG
CTGCGGGGAC AGATTATGGA TTTGCCGACT TATTTCGAGG AACTGAGCCA GAAAATAGCG
TACGACAAAA GCTTGGACAC AACCTATAGC CTGAGTAAAC TAGGCCTGAC TTGCTACACC
GACGAAGGGC AGCGGCAGTA CGGCAACATT TACGCCCGGG GCGCTCTGGT AGCCGGTTTG
CTGGACATTC GCCTGCTCGA ACTGTCGAAC GGAAAACGTG GGTTGCGGGA AGTAATCAAT
GAACTAGCCA CAACCTACGG CCCTAATCAG GCCTTTCCCG AGAAAGAGTT CTTTGCCATT
TTCACCCAGA AGACGTATCC CGAAATCGCC GATTTCTTCA ACCGATACGT GAAAGCGACC
GAGCCTTTAC CCTTCAGCGA CTATTATGGA AAATTAGGAA TCACTTACAT GCCCAGTGTA
AATACGGGTC AGAAAGCACC CTACATGGGT CTGGGAGCCG GTTTTATCGA TAATAAGTTC
TTGCTGACAA GCCTGAGCGA CTCCCTACGC AAAGCGGGTT TGCAGGAAAA AGACGAGTGG
GTTGCCTATA ATGGGCAACC TGTAACGCTG GAAACGTTCA GTGACATTCA ACATGAGTTG
AAGAAGCAAA AGGTGGGCGA TGTGTATGAG CTGACTGTCA GACGAAATGG GCAGGAACTG
AAAATTAAAA GCACGGTTCA GGAAAAAGAA CTCGTTCAAA AATATAAATT TCAACTCGAT
CCACAGGCAA CACCCCAGCA GATTAAACTA CGGGAAGTCT GGCAGCAGAA TCTGTAA
 
Protein sequence
MKILRLLLLA SSTLLFQFST AKPTTPLVYE VNLNDRADDQ FKVTLRVSGL TAANAVYQFA 
STAPGTYQVM DIGRYVRSFK AFDAKGRELK TQQVSTNQWQ FEKPENVRTV QYSIAETWDT
PVNEHKPYNM CGTSIEKDHV LINGQGVFGF PTGMQDAPID VKLNYPAEWS VGTALEKNAK
GYFTAANYDR IVDSPILLGR LTKATTTVAG AQIDVYTYSK SDKIKSDQLL TNMQSMLNAA
GQFLKQLPVK RYTFLYHFED QDWGAWEHSY SSEYVIKEEE FSKKLADNMT SIAAHEFFHV
VTPLNIHSEI IQQFNFVTPT PSEHLWLYEG VTEWASDAMQ LRGQIMDLPT YFEELSQKIA
YDKSLDTTYS LSKLGLTCYT DEGQRQYGNI YARGALVAGL LDIRLLELSN GKRGLREVIN
ELATTYGPNQ AFPEKEFFAI FTQKTYPEIA DFFNRYVKAT EPLPFSDYYG KLGITYMPSV
NTGQKAPYMG LGAGFIDNKF LLTSLSDSLR KAGLQEKDEW VAYNGQPVTL ETFSDIQHEL
KKQKVGDVYE LTVRRNGQEL KIKSTVQEKE LVQKYKFQLD PQATPQQIKL REVWQQNL