Gene Slin_0314 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSlin_0314 
Symbol 
ID8724042 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSpirosoma linguale DSM 74 
KingdomBacteria 
Replicon accessionNC_013730 
Strand
Start bp403032 
End bp405032 
Gene Length2001 bp 
Protein Length666 aa 
Translation table11 
GC content52% 
IMG OID 
ProductX-Pro dipeptidyl-peptidase domain protein 
Protein accessionYP_003385177 
Protein GI284035247 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones31 
Fosmid unclonability p-value0.820883 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGTTGGTAG CCTATGTGAT GCATATCGCT ATCTTTACAG ATATCAACTC CACTACTACC 
CATCAATACC TCATGAAACA CTGGCTTTAT GCGTTGCTTT CAATTGCTTT CATCCTTCCC
GCACACGGTC AACAAACCAC GACCGGCAAC TTCGTTCGCG ATAACTACCA GAAATTCGAG
TACAAGATTC CTATGCGCGA CGGTACCAAA CTGCATACGT CGGTCTACGT TCCTAAGGAT
GCATCGGCTA CGACGAAGTA CCCATTTCTG ATGCAGCGGA CGTGTTATAG CGTTGCGCCT
TACGGTGCCG ATGCTTACCC GGCACAGGTG GGGCCATCGG GAACGCTCAT GCACCAGAAA
TTTATCTTTG TGTATCAGGA TGTACGGGGT CGCTGGGCGT CGGAAGGCAC CTGGACCAAC
ATGACGCCCA ACCTGCCCGA CCCTCCGGCA TCTGCCAAAA AAGCGGGCAA GAAAGGGCAG
ACAACCGTTT CAGCGGCAGG CTCGCCCGAT GAAAGTTCTG ATACCTACGA TACCATTGAG
TGGCTGCTTA AAAACGTACC CAATAACAAC GGTCGGGTAG GGCAGTGGGG CATTAGTTAT
CCGGGCTTTT ACACGGCTGC ATCTCTACCC GACGCGCACC CCGCCCTGAA AGCCGCATCA
CCACAAGCCC CTGTCTCCGA CTTTTTCTTC GATGATTTTC ACCATAACGG GGCCTTCATT
CAGGCTTACC TGTTTACGTT TCCCGTTTTT GGTGTGCAGC ATCCCGAGCC AACCACCAAA
GCCTGGTATA ACGATCAGAT GATTCAGACG GGCACGAAAG ACGGTTTCCA GTGGCAGTAT
GACCTGGGGC CGCTGAAAAA TGCCGATAAA TACTACAAGG ATAATTTTTA CTGGCAGGAA
ACCGTCAACC ACCCCAATTA CGATGCGTTC TGGCAGAAAC GGAGTATCCT GCCACATCTG
AAAAACGTTC GTCCGGCCGT TATGACGGTG GGCGGCTGGT TCGATGCCGA GGATTTATAC
GGTCCGCTGA ACGTGTATAA AACGATTGAG AAGAGCAGTC CCGGTGCCTA TAATACGCTG
GTGATGGGGC CGTTCGGACA CGGCCGGTGG TCGCGCGAAA CGGGCCATAC GCTGCATAGC
AACGTCTATT TTGGCGATAG CATCGCTACG TTCTACCAGC GGAACATCGA ATCGAAGTTT
TTCACGCATT TCCTGAAAGG AGCAGGCGAT GGAAAAAGTG GTTTGCCCGA GGCTTATCTC
TTCAATACCG GCCGTAACGA GTGGAAAACG TTCGACAAAT GGCCCGTAGC CGACGCTTCC
CAAAAACAGT TTTTCCTGAT GTCGGATGGT ACCCTGGCCA CTAACCGGAC GATGGAGGTT
GGCAGCAATC GGTTCTCCGA ATTCATCAGC GACCCGGTTA AACCCGTGCC GTATACGGAA
GATATTACCA CCACGCAGGG CTTTACGCCG TTCAATTATA TGTCGGAAGA CCAGCGGTTT
GCGGGCCGAC GGCCGGATGT GCTCACCTTC CAGACGGAGG TGCTGACCGA AGATTTAACG
CTGGGTGGCG AAATCATGGC GAAGCTCAAA GTGAGTACAA CCGGCACCGA CGCAGACTGG
GTGGTCAAGC TCATTGACGT GTACCCGCCC GATGAACCTA ATCATCCGTA TATGCCTAAC
CGGAACATTA CGCTGGGCAA TTACCAGCAG ATGGTGCGGT CGGAGGCTAT GCGCGGGCGG
TTCCGCAATT CGTTCGAGAA GCCGGAGCCG TTTAAACCCG GCGAGGTGAC TGACGTGAAT
TTCCGCCTTC AGGATGTGCT GCATACGTTC AAAAAAGGCC ACCGGGTCAT GATTCAGGTG
CAGAGTACGT GGTTCCCGCT CATCGACCGG AATCCGCAGA AATACGTGGA GAATATATTT
AAAGCGGATG TTGCCGATTT TCAGAAAGCA ACGCATCGCG TGTACGACAA TTCAGTAATT
GAAGTGCAGG TGTTGAAATA A
 
Protein sequence
MLVAYVMHIA IFTDINSTTT HQYLMKHWLY ALLSIAFILP AHGQQTTTGN FVRDNYQKFE 
YKIPMRDGTK LHTSVYVPKD ASATTKYPFL MQRTCYSVAP YGADAYPAQV GPSGTLMHQK
FIFVYQDVRG RWASEGTWTN MTPNLPDPPA SAKKAGKKGQ TTVSAAGSPD ESSDTYDTIE
WLLKNVPNNN GRVGQWGISY PGFYTAASLP DAHPALKAAS PQAPVSDFFF DDFHHNGAFI
QAYLFTFPVF GVQHPEPTTK AWYNDQMIQT GTKDGFQWQY DLGPLKNADK YYKDNFYWQE
TVNHPNYDAF WQKRSILPHL KNVRPAVMTV GGWFDAEDLY GPLNVYKTIE KSSPGAYNTL
VMGPFGHGRW SRETGHTLHS NVYFGDSIAT FYQRNIESKF FTHFLKGAGD GKSGLPEAYL
FNTGRNEWKT FDKWPVADAS QKQFFLMSDG TLATNRTMEV GSNRFSEFIS DPVKPVPYTE
DITTTQGFTP FNYMSEDQRF AGRRPDVLTF QTEVLTEDLT LGGEIMAKLK VSTTGTDADW
VVKLIDVYPP DEPNHPYMPN RNITLGNYQQ MVRSEAMRGR FRNSFEKPEP FKPGEVTDVN
FRLQDVLHTF KKGHRVMIQV QSTWFPLIDR NPQKYVENIF KADVADFQKA THRVYDNSVI
EVQVLK