Gene Slin_6234 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSlin_6234 
Symbol 
ID8730017 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSpirosoma linguale DSM 74 
KingdomBacteria 
Replicon accessionNC_013730 
Strand
Start bp7560248 
End bp7561588 
Gene Length1341 bp 
Protein Length446 aa 
Translation table11 
GC content59% 
IMG OID 
ProductPeptidase M23 
Protein accessionYP_003390992 
Protein GI284041062 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones38 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAATTAA ACGACTTACC CCGTTCTCTT CAGGTTGGGT TGCTGATCGG CTTGGTCAGC 
GCCTGTACCG GTATCGGTCC AACGGCCAGT TTGTTCCGGC CATCGTCGCC CCGCGAGCAA
TACGGGCAAT CCCTCAAACA GGCAAAGCTG GACCAAACGG CCCTCGGCAC CGACTGGCTG
TCGGCGGGTG AGCGTGCCCT CCAGGATTCG CTCAAGATCA CGGTTCCGTA CCGCGAAAGC
GGTTATTTCT CGGCCAGCAA AGCCTTTGCC GTAGGCTATC GGCTGGAGGC ACAGCGGGGC
GACCGGTTTC TGGTAAAAGT CGAAACCCAA GGTCAGAAAG AAACACAGGT GTTTATCGAT
GTATTCGTGC TCGAAAGTCG TGGTAAAAGC AGCCTGGTAG CGGCTTCCAA AGCAGATACT
AATGTGCTCG CCTGGGAACC CCGACGAACG CAAACGTACC TCATCCGCAT TCAACCCGAA
TTGCTGCGAA CGGGGAGCTA CACGATCTCT ATCACGCGCG AACCGGCCCT GAGCTTTCCC
GTTAAGGGGC GCGACAGCCG ACAGATCAGC AGCTTCTTCG GCGTGGCGCG GGATGGAGGT
CGCCGACGGC ACGAAGGGGT CGATATTTTT GCGCCCCGGG GCACACCCGC CCTAGCCTCG
GTCGATGGCG TTATTTCGGG GGTAGGGGTT AGCAAACTGG GCGGCAACGT AGCGTTTCTG
ACCGACAATG ATCGCAATAT CCGGCTGTAT TACGCCCACC TCGACCGCTG GAATGTTACC
AACGGGCAGC GCGTATCCAT TGGCGACACC GTTGGCTTCG TCGGTAATAC CGGTAACGCC
CGTACCACCG GCCCACATTT GCATTTCGGT ATTTATGGTT TTACGGATGG CGCCACCGAC
CCCCTGCCGT TTATCCGGAT GGGGCGTGGT CCGGCCAAAC AATCGCTGCT GTCGGCGAAC
CGGCTTGGCG ATTCGGTGCG GGTATCTGCG GCCAAGTCTG TCGTTCGATT GTCGCCCGGA
AGCGAGGGCG TGGCGCTTCG TGAGGTACCG AAGGCAACGG CACTGACCAT CCTTGGCGGT
ACTGAATCCT GGCTGCGGGT CGAGTTGCCG GATGGGCTTA TCGGCTATGT CGCCAGTAGC
GCCACCGAAG CCGAAAAACG CCCGCTCCGG CGTCTGGTAT TGCCCACATC AAAGCCACTG
CTGGACGCGG CCTATGCACA AGCTGCCACC ATCACCACTT TGCCAACCGG TGCTGCCCTT
GAGGTACTGG CCACAGCCGA TGCCTTTCAG CTGGTTCGAA ATGAAGCGGG GCAAACAGGT
TGGGTAATGG TGGGGCCGTA A
 
Protein sequence
MELNDLPRSL QVGLLIGLVS ACTGIGPTAS LFRPSSPREQ YGQSLKQAKL DQTALGTDWL 
SAGERALQDS LKITVPYRES GYFSASKAFA VGYRLEAQRG DRFLVKVETQ GQKETQVFID
VFVLESRGKS SLVAASKADT NVLAWEPRRT QTYLIRIQPE LLRTGSYTIS ITREPALSFP
VKGRDSRQIS SFFGVARDGG RRRHEGVDIF APRGTPALAS VDGVISGVGV SKLGGNVAFL
TDNDRNIRLY YAHLDRWNVT NGQRVSIGDT VGFVGNTGNA RTTGPHLHFG IYGFTDGATD
PLPFIRMGRG PAKQSLLSAN RLGDSVRVSA AKSVVRLSPG SEGVALREVP KATALTILGG
TESWLRVELP DGLIGYVASS ATEAEKRPLR RLVLPTSKPL LDAAYAQAAT ITTLPTGAAL
EVLATADAFQ LVRNEAGQTG WVMVGP