Gene Slin_5953 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSlin_5953 
Symbol 
ID8729734 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSpirosoma linguale DSM 74 
KingdomBacteria 
Replicon accessionNC_013730 
Strand
Start bp7216446 
End bp7217807 
Gene Length1362 bp 
Protein Length453 aa 
Translation table11 
GC content51% 
IMG OID 
Productpeptidase M20 
Protein accessionYP_003390714 
Protein GI284040784 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones41 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTACCT ACCTGAAAAC AAATAAGCAG CGGTTCCTGG ACGAACTGCT GGAGTTATTG 
CGAATTCCAT CGGTTTCGGC CGATTCAAAT TTTAAAGGCG ATGTTCGTCG TGCCGCCGAG
TTTGTAAAAG ATAAGCTACA GGCGGCTGGT CTGGACAACG CCCAACTTTT CGAGACGCCG
GGCCATCCGA TTGTCTATGC CGAAAAATTA GTTGATCCGG CAAAGCCAAC GGTGTTGGTG
TATGGCCATT ACGACGTACA ACCCGCTGAC CCGTATGAGT TATGGCATAC GCCCCCGTTT
GAGCCAACCA TCCGCAACGA GCGGATTTAT GCCCGGGGTG CCTGCGATGA CAAAGGCCAG
TTTTATATGC ACATCAAAGC CATTGAAGCC ATGGTGGCAA CGGATGGTTT GCCGTGCAAT
GTAAAAGTGA TGATTGAAGG AGAAGAGGAG GTAGGTTCCG ACCATCTGGG CATCTTCGTC
GCCAACCACA AGGAAATGCT TAAGGCCGAT GTTATTCTGG TGTCTGATAC AAGCATCATC
TCGAACGAAA CACCGTCTCT GGAAACTGGG CTACGTGGCC TGTCGTACGT GGAGGTACAC
GTAACGGGCG CTAACCGCGA TCTGCATTCG GGTGTGTATG GGGGGGGGGT CGCTAATCCG
ATCAATGTAT TGTGCGAGAT GATTGCGTCC TTGCACGACG AGAAGGGGCG CATCACCATT
CCCGGCTTTT ACAACAATGT AGCTGATCTG AGCGATGAGG AACGGGCCGA ACTCGCCAAA
GCGCCTTTCG ACCTGGAGGA GTACAAGCGC GATCTGGGCA TTAACGATGT GATGGGTGAG
GCTGGCTATT CGACCAACGA ACGTACGTCT ATTCGCCCAA CACTCGACGT AAACGGTATT
TGGGGCGGCT ACATCGGCGA AGGCGCAAAA ACGGTATTGC CTTCTAAGGC GTCGGCTAAA
ATCAGTATGC GTCTGGTGCC GAACCAGACA CCCGACGAAA TTACCGAACT CTTTACCAAT
CACTTTCTGT CCATCGCTCC TTCCGGCGTT ACTGTAACGG TAGAGCCACA TCATGGCGGT
ATGCCTTATG TGACACCCGT TGATTCCGTT GAATTTGAAG CCGCCAGCAA AGCCTTTGAA
GATGCCTGGG GGAAAAAGCC AATACCGACC CGAGGGGGCG GCAGTATTCC TATTATGGCC
CTTTTTGAAC AAGAACTGGG CATCAAGTCG ATTCTGATGG GTTTTGGCTT AGATAGCGAC
GCGCTTCATT CGCCCAATGA AAGTTATGGA CTGTTTAACT TCTACAAGGG AATTGAAACA
ATCCCGTATT TCTATAAACA TTACGCTGCC CTCAAACAGT AA
 
Protein sequence
MTTYLKTNKQ RFLDELLELL RIPSVSADSN FKGDVRRAAE FVKDKLQAAG LDNAQLFETP 
GHPIVYAEKL VDPAKPTVLV YGHYDVQPAD PYELWHTPPF EPTIRNERIY ARGACDDKGQ
FYMHIKAIEA MVATDGLPCN VKVMIEGEEE VGSDHLGIFV ANHKEMLKAD VILVSDTSII
SNETPSLETG LRGLSYVEVH VTGANRDLHS GVYGGGVANP INVLCEMIAS LHDEKGRITI
PGFYNNVADL SDEERAELAK APFDLEEYKR DLGINDVMGE AGYSTNERTS IRPTLDVNGI
WGGYIGEGAK TVLPSKASAK ISMRLVPNQT PDEITELFTN HFLSIAPSGV TVTVEPHHGG
MPYVTPVDSV EFEAASKAFE DAWGKKPIPT RGGGSIPIMA LFEQELGIKS ILMGFGLDSD
ALHSPNESYG LFNFYKGIET IPYFYKHYAA LKQ