Gene Slin_6149 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSlin_6149 
Symbol 
ID8729930 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSpirosoma linguale DSM 74 
KingdomBacteria 
Replicon accessionNC_013730 
Strand
Start bp7455289 
End bp7456830 
Gene Length1542 bp 
Protein Length513 aa 
Translation table11 
GC content49% 
IMG OID 
Productpeptidase S41 
Protein accessionYP_003390908 
Protein GI284040978 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.451646 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones41 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAGCCA TCGGCTGTCT ACTCCTTTCG TTCTTGTTTT CGTTTAGCGG CCACGCCCAG 
CCAGCCATCG ACTCACTGAA AACCCGGGTC CTGCAACCGT CCGCTATGCA GGCCGATTTT
CGCTACCTCC GCAAGCTGCT GGAAGAAACA CATCCGGGTT TATATCGCTA CACGCCCAAA
GCAATCATGC AGGCAAAGCT GGACAGTATT GCCGGTACGC TGACAAAACC GCTTCCGTTT
TATAAATTTT ACGGGACCAT TGAAGCTCTT ATGGCCGACA TCCGCTGTGC CCATACGCAT
GCATTACCGG AGAAAAACTG GCGCAATCAA TTCAATAAAG TCCAGAAAAC GAACCCATTT
TTCATATGGT CTACTCAACA GCGCTTCTTC GTCCTGATGA ACGGCACGAC CGACCAAACC
ATCAAACCCG GCTTTGAACT ACTTAGTATT AACGGCCAGT CGATGGACGA TATCCGACAG
CAAATGGACC GACACCATTG GGCCGATGGC TACATCCAAT CATCGAAAAG TCAGATGCGG
GGTGAATTTT TTGACTTGTT CTATTATTGG TTCGTTGGCC AGCCAGATAC GTTTTCGTTC
AAATTTCGCA GTCTGACGGG CGACACAGTT CAAGTGAATG CCGAAGCGAA ACCCTACCGC
GTGTCGTTGC GGCAAATGCT CAAAAATCCC GTCAACAAGC AAATGGTAGC CTGGTATGTC
AACAAAAAAC AGAAACACCC CTGGCGCCTG TCGTTCCCCG ATACGCTGAC GAACACCGCT
ATTCTTCGAT TCGACGGATT CGGTGGAGAG GGAGCAAGAA ACAGTACCGA AGCCGTGACC
GTCTTTCGGG CATTTATGGA TAAGAGTATG GATAAACTTA AAAAGCAACG AACAAAGCAT
TTGGTCATTG ATGTCAGAGG TAATACGGGT GGGTGGGACA GTCAGGGTAT CGAGTTATTT
ACCTATCTGA TGAAAACGGA TTCAGCCGTA CCCTATCACA CTCGCCAGCA TAGCATTAGC
GATGGCACTA ATGGCAGTGA GTTTCTCCAA TTTTCGGACC TCTCCGAAGC CAACCGCAAA
AACATAAAGA ACGAGTTAAT CCCTGAGGCC GATGGTACGT TTACCCTTAA ACAGGCCAGC
GACACTGATT CGACGGGCCG AACCCCCAAA CGATATACTC CTAAGCCCAA TCGGTTCAAG
GGACAAGTTT ATTTGCTGAT GAATGGAGAA AGTGCCTCAA CGGCGTCGGA GTTTCTGGCG
GTTGCTCATG CCAACAATGT GGGGGTGTTT ATCGGTACAG AATCCGGGGG CGCGTATGAA
GGGGGGAACG GGGGTAGTTT TATTACCCTT GAACTGCCCA GGTCAGGTAT ACAGGTAACA
ACACCGCTGG TGTACTACAA CAATGCCGTA CCTGAACCGA AGCAGAAAGG GCGCGGCACA
CTGCCGGATT ACTACGTGCC CGTTACAATA AATGATTTAC TACTGCACAC CGATTCACAA
TTTAATTTTG TCGTAACCTT GATTCGGAAG CAACCTCAAT GA
 
Protein sequence
MKAIGCLLLS FLFSFSGHAQ PAIDSLKTRV LQPSAMQADF RYLRKLLEET HPGLYRYTPK 
AIMQAKLDSI AGTLTKPLPF YKFYGTIEAL MADIRCAHTH ALPEKNWRNQ FNKVQKTNPF
FIWSTQQRFF VLMNGTTDQT IKPGFELLSI NGQSMDDIRQ QMDRHHWADG YIQSSKSQMR
GEFFDLFYYW FVGQPDTFSF KFRSLTGDTV QVNAEAKPYR VSLRQMLKNP VNKQMVAWYV
NKKQKHPWRL SFPDTLTNTA ILRFDGFGGE GARNSTEAVT VFRAFMDKSM DKLKKQRTKH
LVIDVRGNTG GWDSQGIELF TYLMKTDSAV PYHTRQHSIS DGTNGSEFLQ FSDLSEANRK
NIKNELIPEA DGTFTLKQAS DTDSTGRTPK RYTPKPNRFK GQVYLLMNGE SASTASEFLA
VAHANNVGVF IGTESGGAYE GGNGGSFITL ELPRSGIQVT TPLVYYNNAV PEPKQKGRGT
LPDYYVPVTI NDLLLHTDSQ FNFVVTLIRK QPQ