Gene Slin_2037 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSlin_2037 
Symbol 
ID8725775 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSpirosoma linguale DSM 74 
KingdomBacteria 
Replicon accessionNC_013730 
Strand
Start bp2461389 
End bp2463245 
Gene Length1857 bp 
Protein Length618 aa 
Translation table11 
GC content52% 
IMG OID 
Productpeptidase M61 domain protein 
Protein accessionYP_003386881 
Protein GI284036951 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value0.129512 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAAAAA ACGCCCGGCT TCTGTATTTA CTTACTGTCT GGACATTTTT CACTCCATTT 
TCATCAACTG CTACCCCAAC ACTTGCCGAC GAAGAAAACC CACCGGCAGC CACGCCTACC
GTAACGTATG TGCTCTCGAT GCCTGAGCCG CAGACGCATT ATTTCGAGGT GGAAATGCAA
TTGAAGAATG TGGCCGCTGC CACCAATGCG AAGAAAAATG GCTATGTAGA CATCAAAATG
CCGGTTTGGA CCCCCGGTTC TTACCTGATT CGTGAATACG CCAAAAATGT AGAAGCGTTT
ACGGCATCGG CTGGCGGCAA GACGGTTCCC AGCGAAAAAA TCCGGAAAAA TGCCTGGCGT
GTCCAGTCGA CCGACGATAA CCTGACCATC AAATACCGGG TATACGCCAA TGAGCTGACC
GTTCGGACGA GTTTTGTCGA TGCCGATCAC GGTTACGTAA CCCCCGCCGG TGTGTTTATG
TACCACGATG CCTTGAAAAA CATCCCCCTG CGCGTGGTTG TACAACCGTA CAAAGATTGG
AAAAACGTGG CGACAGGCCT CGAACCCGTT GCAGACCAGT CGTATACCTA CGAAGCTGCC
GATTTCGATA TTCTGGTCGA TTCCCCCATC GAGATCGGTA ATCATACCAC GTTCACCTTC
ACGGCGTCGA ATGTTCCGCA TTCGGTATCC ATGTTTGGCG ATGTAAATTA TAATGAGAAG
CAACTGGCGG CCGATTATAA GCGGGTTTGC GAAGCAGCCG CAACCGTGGT TGGCGAGCAT
CCCTGCAAAC ACTACACCTT TATTGTTCAT CATATCCCGC CGGGTGGGGG AGGACTCGAA
CACCTGAACT CCACAACCCT CGAAACGACC CGTAACGCCT ATTCGACCGA AGCGAACTAC
AAGCGGTTTC TGTCGCTGGT AGCCCACGAG TACTTTCACC TCTGGAATGT AAAACGAATT
CGCCCCGTTG CGCTGGGTCC GTTCGATTAT GAGAACGAGA ATTACACGCA CATGCTCTGG
CTGTCGGAAG GCTGCACGTC GTTCTATCAG GAATATATCC TGCGAAGGGC GGGTTTTCAC
TCGCCTGAAT CCTACCTCAA TCTGGTGGCC AGCAGCATAA CGGATATTGA GAACCAGCCC
GGTACGCGGG TGCAGTCGGC GGCTGAGTCC AGCTGGGATG CCTGGATCAA AGGGTACCGG
CCCAATGAAA ACTCCAGCAA CACCACAATT TCGTATTACA GCAAAGGCAG TGTGCTGGGC
ACATTGCTGA ATCTGGCTAT TCTGGGCGGA AGCAACGGGG AGCGCAATAT GGACGATCTG
ATGCGGCTTC TGTATTCGGA ATATTACAAA AAACAGAAAC GTGGGTTTAC GGACGATGAG
TTTCGTAAAG CAGCCGAGCA GGTGGCCGGT CGTAAACTCG ATGACTTTTT TAACATCGGT
GTTAACAGTG CCGAACCAAT TAATTACAAT ACGTACTTAG AGCCGGTGGG TATGCAGTTG
ATCAACGTTG CGGCCAGAAC GCAGGATGGT TTTCTGGGTG CTGCCACGAC GGTGGCAAAC
GGGAAAGCGA CTATTTCATC CGTTCGGCGC GGGTCGGCGG CTTACCAGGA TGGCCTTAAT
GTGGGTGATG AAGTTATTTC GGTAGATGGG TTCCGGGTTG GCGATGATCT GCTTCGGTTT
GTCAGCGGCC GTCGGGTGGG TGATAAACTG CTTGTGCTGG TTAACCGGGC GGGACAACTT
CGCGAAATTC CCGTTACCCT CACCCAGAAT CCGCTGGCCA GTTACCGGAT CGAGCCATTG
AGCAACCAGA CAGCTGCCCA AAAAGCGTTG TATGCGAAAT GGTTATATAT TAAATGA
 
Protein sequence
MKKNARLLYL LTVWTFFTPF SSTATPTLAD EENPPAATPT VTYVLSMPEP QTHYFEVEMQ 
LKNVAAATNA KKNGYVDIKM PVWTPGSYLI REYAKNVEAF TASAGGKTVP SEKIRKNAWR
VQSTDDNLTI KYRVYANELT VRTSFVDADH GYVTPAGVFM YHDALKNIPL RVVVQPYKDW
KNVATGLEPV ADQSYTYEAA DFDILVDSPI EIGNHTTFTF TASNVPHSVS MFGDVNYNEK
QLAADYKRVC EAAATVVGEH PCKHYTFIVH HIPPGGGGLE HLNSTTLETT RNAYSTEANY
KRFLSLVAHE YFHLWNVKRI RPVALGPFDY ENENYTHMLW LSEGCTSFYQ EYILRRAGFH
SPESYLNLVA SSITDIENQP GTRVQSAAES SWDAWIKGYR PNENSSNTTI SYYSKGSVLG
TLLNLAILGG SNGERNMDDL MRLLYSEYYK KQKRGFTDDE FRKAAEQVAG RKLDDFFNIG
VNSAEPINYN TYLEPVGMQL INVAARTQDG FLGAATTVAN GKATISSVRR GSAAYQDGLN
VGDEVISVDG FRVGDDLLRF VSGRRVGDKL LVLVNRAGQL REIPVTLTQN PLASYRIEPL
SNQTAAQKAL YAKWLYIK