Gene Slin_1858 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSlin_1858 
Symbol 
ID8725595 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSpirosoma linguale DSM 74 
KingdomBacteria 
Replicon accessionNC_013730 
Strand
Start bp2241175 
End bp2243304 
Gene Length2130 bp 
Protein Length709 aa 
Translation table11 
GC content51% 
IMG OID 
ProductOligopeptidase A 
Protein accessionYP_003386702 
Protein GI284036772 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.375638 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones45 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTAGTCC CTCAAATCAA GAAGACCGGC GTTTTACTGG CGATCACTGC TACCACAGTG 
TTGGCCCAGC CTTCGCCCCA ACAACCCAAG ATGACCCAAA ATCCATTTCT CACGTCGTAC
ACGACACCGC ACCAGACGGC GCCGTTCGAC AAGATCAAGA ATGCCGATTA TCTCCCCGCG
CTTAAAGACG GTCTGGCACA AGGCCGCAAA GACGTGGATG CAATCGTTAC TAATACCGCT
GCGCCAACCT TTGAGAACAC CATCGTTGCC CTCGAACGTG CGGGCGACCT GCTTGGTAAA
GTAACGTCGG TGCTGTTTAA TTTGAACAGT GCAGAAACAT CTCCTGAGCT TCAGAAAATT
GTAAAGGAAG CGTCGCCCCT GTTGAGTGAG TATGGGAATG ATATTACCCT GAATGCAAAA
CTGTTTGCCC GAATCAAAAC CGTGTACGAG CAGCGTGCCA AGCTGAAACT TGATCCGGAA
AGTGCTATGT TGCTCGAAAA AACGTACAAA CGCTTTTCGC GTAACGGGGC CAATCTGGAC
GATAAAGGCA AAGAGCGCCT GCGGGTTATC GACAAAGAGT TGTCGCAGTT GTCGCTGCAA
TTTGGGGAGA ATGTCCTGAA CGAAACCAAC GAATACCTGA TGGTGGTGGC GGACGAGAAA
GACCTCGCCG GGTTGCCCGA CTTCGCGCGC GATGCGGCTA AAGCAACGGC CAAACAGAAA
GGAAAAGAAG GCTGGGTGTT TACCCTGCAG GCCCCGAGCT ATGGCCCTTT CATGCAGTAT
GCCGACAATC GGGAGTTGCG TAAGAAACTT TATCTGGCTT ATAACGGACG GAGTTTTCAC
GGCGATAAGA ACGACAATTC GGCCATCATC AACAAGATTG TAAACCTACG CTTCGAGCGC
GCTAATCTGT TGAATTACAA AACCCACGCT GACTTTGTGC TGGAAGAAAG TATGGCTGGT
TCGAGAGATA AAGTCCAGAG TTTCCTGGAA GAGCTGGTAT CGTATGCACG TCCGGCAGCC
GAGCGTCAGC TTGTAGAACT AACGACCTAT GCCAAAGCCC ACGGTTTCCA GGATGATAAA
TTACAGGCAT GGGATAGTGG TTATTATTCG GAAAAGCTGA AGAAAGAGAA GTACGATCTT
GACGATGAAA TGCTGAAACC GTACTTCAAA CTGGAGAATG TCCTGAACGG GGTCTTCACC
ATAGCAAACA AGTTATATGG TGTCACGTTC AAGGAGCGTA CCGATATTCC GGTGTACAAC
CCGGAGGTAA AAACCTTCGA CGTTTTTGAT AAAGACGGCA AATTTCTGGC AGTTTTTTAC
GGTGATTACT TCCCACGGGC GGGCAAGCGG AGCGGGGCCT GGATGAACGA CATTCAGGGG
CAGAAAATCG AGAACGGGAC CAACATCCGG CCGCATATCA TTAACGTCTG TAACTTCACC
CGCCCCACGG ATACCAAGCC TTCGCTGCTG ACATTCTATG AGGTGACAAC CCTCTTCCAC
GAATTCGGGC ATGGCCTGCA TGGTATGCTG GCGAATGGTA AATACGAAAG TCTGAGCGGC
ACCAGCGTTC CCCGCGATTT CGTTGAGTTA CCCTCGCAGG TAATGGAAAA CTGGTGTTAC
GACCCCGAAG CACTCAAGCT TTTTGCGAAG CACTACCAAA CGGGCGAAGT GATTCCGAAT
GAACTGATTG AAAAGATCCG CGCCAGCCAG AACTTCATGG CAGGTATGGC TAATCTGCGT
CAGTTGCGTT TGGGCCTGGT CGATATGTAC TATCACGGCC AGAAACCAAC CGGCGAAACG
ATCTCGCAGG TTGAAGGTCG GGTAGATTCT GTTGCGAACC TGTTCCCGCA CGTGGATGGG
GTGGCGATCA GCCCGGCTTT CTCGCACATT TTTGCGGGTG GCTACTCGGC CGGCTATTAC
AGCTACAAAT GGAGTGAAGT GCTGGATGCC GATGCTTTTG AGTTCTTTAA AGAAAAAGGT
GGCCTCGAAA ACAAAGCAGC TGCCGACAGT TTCCGCAGGA ATGTACTGGA AAAAGGCGGT
AGTGAAAAAC CTATGGAACT CTACAAAAAA TTCCGGGGCC GCGAACCATC GCCCAAAGCC
ATGCTCCGTC GCAGTGGATT GATATTGTAA
 
Protein sequence
MLVPQIKKTG VLLAITATTV LAQPSPQQPK MTQNPFLTSY TTPHQTAPFD KIKNADYLPA 
LKDGLAQGRK DVDAIVTNTA APTFENTIVA LERAGDLLGK VTSVLFNLNS AETSPELQKI
VKEASPLLSE YGNDITLNAK LFARIKTVYE QRAKLKLDPE SAMLLEKTYK RFSRNGANLD
DKGKERLRVI DKELSQLSLQ FGENVLNETN EYLMVVADEK DLAGLPDFAR DAAKATAKQK
GKEGWVFTLQ APSYGPFMQY ADNRELRKKL YLAYNGRSFH GDKNDNSAII NKIVNLRFER
ANLLNYKTHA DFVLEESMAG SRDKVQSFLE ELVSYARPAA ERQLVELTTY AKAHGFQDDK
LQAWDSGYYS EKLKKEKYDL DDEMLKPYFK LENVLNGVFT IANKLYGVTF KERTDIPVYN
PEVKTFDVFD KDGKFLAVFY GDYFPRAGKR SGAWMNDIQG QKIENGTNIR PHIINVCNFT
RPTDTKPSLL TFYEVTTLFH EFGHGLHGML ANGKYESLSG TSVPRDFVEL PSQVMENWCY
DPEALKLFAK HYQTGEVIPN ELIEKIRASQ NFMAGMANLR QLRLGLVDMY YHGQKPTGET
ISQVEGRVDS VANLFPHVDG VAISPAFSHI FAGGYSAGYY SYKWSEVLDA DAFEFFKEKG
GLENKAAADS FRRNVLEKGG SEKPMELYKK FRGREPSPKA MLRRSGLIL