Gene Slin_0934 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSlin_0934 
Symbol 
ID8724664 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSpirosoma linguale DSM 74 
KingdomBacteria 
Replicon accessionNC_013730 
Strand
Start bp1121154 
End bp1123298 
Gene Length2145 bp 
Protein Length714 aa 
Translation table11 
GC content53% 
IMG OID 
ProductPeptidyl-dipeptidase Dcp 
Protein accessionYP_003385785 
Protein GI284035855 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones32 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACATTG GACTAACCCT TCTTGCCCTA ACGACTGTGC TCACCACGTT TACATCAGGT 
GTTGATTCAC CAACAGCACC CGACCCCAAT CCGTTTTTCA GTACGTACAA TACGCCATTT
GGAGTGCCGC CCTTTGATCA GATCAAGCCC GAGCACTTTG AACCGGCCAT CGAAGAAGGC
ATACGGCAGC AAACGGCCGA AATCGAAACC ATTACCAAAC AGAAAGCAAC TCCCACCTTT
GCCAACACGG TTGAAGCGCT GGAAGCCAGC GGAGATTTGC TTCGGCGGGT CAACACCGTA
TTGGGGAACC TCAACGGTGC CAACACCAAC GATCAGCTAC AGAAAATTGC TCAGACAGTA
GCCCCAAAAC TGGCTAAACA CAGCGATGAC ATTATGCTTA ATCCGGCCTT GTTCGGGCGG
GTAAAAGCGG TTTACGACGG TCGCGCCAAA CTAAAATTAT CCGGCGATCA GCAGCGTTTG
CTCGAAAAAA TGTACAAGAA CTTTGTGCGA AACGGAGCCG CCCTGACCGC CGACAAGCAG
ACCCGTTTGC GCCAGATCAA CGGCGACGTA TCGGTACTGA CGCTGAAATT CGGGCAGAAT
CTGCTCGCCG AAAACAATAC CTATGCCCTG ATCATCGACA AGGCCGACGA CCTGAGCGGT
TTACCAGCCT CGGTCGTAGC CGCAGCCGCC GAAGAAGCAA AAAAGCGAAA ACTGACAGGC
AATAAGTGGG TTTTCACACT TCAGAATCCA AGTATCATGC CGTTTCTGCA ATACGCCGAT
AACCGGGCTC TGCGCGAGCA ACTGTTGAAG GCGTACCTCG AACGGGGTAA CCACAACGAC
GAACACGACA ACAAAGCGAT CATGGCCAAC ATGGTGTCGC TGCGAGCCGA AAAAGCGCAG
TTGCTCGGGT ACGACAACCA TGCAGCTTAC GTGCTGGAAG AAAGCATGGC CCAAACGCCC
GACAAAGTGG CTCAATTACT GAATCAACTC TGGTCGGCAA CGGTGCCCGT AGCGAAGCAG
GAAGCCGCCG ACTTACAGGC TATGCTGGAC AAAGACGAAA AGGCCAATCC CGGCCGGAAC
CAGAAATTGG CCAGCTGGGA CTGGCGCTAT TACGCCGAAA AACTGCGGAA AGAAAAATAC
GCCCTCGACG AGCAGGAACT ACGTCCTTAT TTTTCGCTGG AAAGTGTACG GGATGGCATT
TTCATGCTAA CCAACCGGCT GTATGGCCTG CGTTTCGAAC CCCGTACCGA TATTCCGGCC
TACCACGAAG AAGCCACGGC CTACGAGGTT AAAGAAGCCG ACGGGCGGCA CATCGGGATC
ATTTACATGG ATTTTTTTCC AAGAGCCAGC AAACGCGGGG GTGCCTGGAT GACCAGCTAC
CGGCGGCAGG AAGTAGATAA TGGCAAAAAA ATAGCGCCCG TTGTATCCAT CGTCTGCAAC
TTCTCCCGGC CATCGGGCGA CGCCCCTGCC CTGCTCACCT TAAACGAAAC CAGCACCTTT
TTCCACGAAT TTGGTCATGC CCTGCACGGG CTGCTGTCCA ACGTCCATTA CGGGAGCCAG
TCGGGTACGT CGGTGCCGCG CGATTTCGTT GAACTACCCT CGCAGATTAT GGAAAACTGG
GCTGTAGAGC CGGAAATGCT TCGGTTGTTT GCCAAACATT ATAAAACCGG GGCCGTCATT
CCCGACGCAC TGGTCGACAA GATCAAGCGG AGCAGCCTGT TTAATCAAGG CTTCGAAACG
AGTGAGTATT TGGCCGCTTC GCTGCTCGAC ATGGCCTATC ACACGCTGAA ACCGGACCAG
ACACCGACTG ATGTGCTGGC TTTTGAAAAA CAGGCAATGG ACAAAATTGG CTTGATCGAC
CAGATTCCAC CGCGCTACCG GAGTACCTAC TTCCAGCACA TTTTTTCGGG CGGTTACTCG
GCAGGGTATT ACAGCTATAT CTGGTCGGCG GTGCTCGATG CAGATGCTTT TGAGGTGTTT
AAGCAAAAAG GACTCTTCGA CCCTAAATCG GCTCAGTCGT TCCGCAAAAA CGTCCTTGAA
AAAGGCGGCA CTGAGGATCC AATGACGCTT TATCGCAAAT TCCGGGGTGC CGAACCCGAC
ATCAAGCCGC TGCTTCGCCG GCGTGGGCTC ATGAAGGATG TTTAG
 
Protein sequence
MNIGLTLLAL TTVLTTFTSG VDSPTAPDPN PFFSTYNTPF GVPPFDQIKP EHFEPAIEEG 
IRQQTAEIET ITKQKATPTF ANTVEALEAS GDLLRRVNTV LGNLNGANTN DQLQKIAQTV
APKLAKHSDD IMLNPALFGR VKAVYDGRAK LKLSGDQQRL LEKMYKNFVR NGAALTADKQ
TRLRQINGDV SVLTLKFGQN LLAENNTYAL IIDKADDLSG LPASVVAAAA EEAKKRKLTG
NKWVFTLQNP SIMPFLQYAD NRALREQLLK AYLERGNHND EHDNKAIMAN MVSLRAEKAQ
LLGYDNHAAY VLEESMAQTP DKVAQLLNQL WSATVPVAKQ EAADLQAMLD KDEKANPGRN
QKLASWDWRY YAEKLRKEKY ALDEQELRPY FSLESVRDGI FMLTNRLYGL RFEPRTDIPA
YHEEATAYEV KEADGRHIGI IYMDFFPRAS KRGGAWMTSY RRQEVDNGKK IAPVVSIVCN
FSRPSGDAPA LLTLNETSTF FHEFGHALHG LLSNVHYGSQ SGTSVPRDFV ELPSQIMENW
AVEPEMLRLF AKHYKTGAVI PDALVDKIKR SSLFNQGFET SEYLAASLLD MAYHTLKPDQ
TPTDVLAFEK QAMDKIGLID QIPPRYRSTY FQHIFSGGYS AGYYSYIWSA VLDADAFEVF
KQKGLFDPKS AQSFRKNVLE KGGTEDPMTL YRKFRGAEPD IKPLLRRRGL MKDV