Gene Slin_3346 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSlin_3346 
Symbol 
ID8727099 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSpirosoma linguale DSM 74 
KingdomBacteria 
Replicon accessionNC_013730 
Strand
Start bp4046461 
End bp4047822 
Gene Length1362 bp 
Protein Length453 aa 
Translation table11 
GC content55% 
IMG OID 
Productpeptidase M20 
Protein accessionYP_003388155 
Protein GI284038225 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value0.0581129 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATCGAG TACTTCTTCT CCCTGCTCTG GCATTCGCGT CTCTCCTTTA CCTTTCTGAA 
CCACTTCAGG CGCAGCAGAA AACATCAAAA ACAACAGCTA CTTCCGGGGT CGAGAAACGA
TACGTTGACG AAATAGGCAC CCTTGCCAGT CAATCTTCCG TAAAAAAAGC GTTTCAGTTT
TTCGTTGATC TGGAACCCCA GACCATGAAG GATCTGATCA ACCTAACCGA AACACCTTCT
CCGCCCTTCA AGGAAACCGT ACGGGCTAAA AAATACGCGG CCATGCTGAA AGAGGCCGGA
GCCGACTCGG TCTGGATCGA CGAGGTGAGC AACGTGATCG CCAAACGAAA AGGGCGGAAG
GGCAGCAAAA CAGTAGTCAT TGAATCGCAC CTGGATACCG TTTTTCCCGA AGGCACAGAT
GTAAAAGTCA AGTATAAAGG CGACACCCTT TATGCACCGG GCGTAGGCGA CGACACCCGC
GGCCTGACCG CCATTCTGGC CGTTCTGAAA GGTATGGAAG CTGCTTCTAT CGAAACAGAT
GCCGACGTAC TGTTTGTGGG GGCTGTGGGT GAAGAAGGCC TGGGCGACCT GCGTGGAGTG
AAGCACCTGC TCCGAAAAGG CGGCCCCAAG GTCGACTCCT ACATTGCCGT AGATGGCGAC
GGCATCAGCA GCATCGTGCA CCGGGGACTA GGCTCGCATC GGTACCGAAT TACCTTTAAG
GGGCCGGGCG GGCATTCGTA CGGCTCGTTT GGCATTGTTA ACCCGCACAG CGCCCTCGGA
AAAGCCATTT ACTACTTTAC GACTGAAGCC GATAAGGTGA CGCGGCAGGG CGTTAAGACA
ACCTATAGCG TCAGTGTGAT CAACGGCGGC ACGTCTGTCA ATGCGATTCC CTACGAGTCG
TGGATGGAGA TCGACATGCG CTCCGAAAGT CCAGAGAAAC TCAATGAAGT CGACCAGTTG
TTACAGGCGG CCGTTCAGCG GGCGTTAAAC GAAGAGAACG GCATCAAACG GCAGGGACCG
GATTTGACGG TAGATGTCAA AAAGATCGGG GATCGACCAT CGGGTAAAAC CGATGCATCG
GCGGCTATTG TTCAACGGGC CATGGCCGCT ACCAGCTATA TGAACGTGGC TCCCCAACTG
GACGTAGCTT CAACAAACGC TAACACGCCC ATTGCGCTGG GGATTCCGGC GGTTACCATC
GGGAGTGGCG GCACTGGCGG GGGCGAACAC TCGCTGAACG AGTGGTGGCT GAACGACAAA
GGTTACCTGG GTATGCAGCG CATTCTGTTG GTTCTTCTGG CCGAAGCTGG ACTGGATAAA
GGAGCAGCAG CCTCGAAGGT GAAAGCGGAG AAACGGCGGT AA
 
Protein sequence
MNRVLLLPAL AFASLLYLSE PLQAQQKTSK TTATSGVEKR YVDEIGTLAS QSSVKKAFQF 
FVDLEPQTMK DLINLTETPS PPFKETVRAK KYAAMLKEAG ADSVWIDEVS NVIAKRKGRK
GSKTVVIESH LDTVFPEGTD VKVKYKGDTL YAPGVGDDTR GLTAILAVLK GMEAASIETD
ADVLFVGAVG EEGLGDLRGV KHLLRKGGPK VDSYIAVDGD GISSIVHRGL GSHRYRITFK
GPGGHSYGSF GIVNPHSALG KAIYYFTTEA DKVTRQGVKT TYSVSVINGG TSVNAIPYES
WMEIDMRSES PEKLNEVDQL LQAAVQRALN EENGIKRQGP DLTVDVKKIG DRPSGKTDAS
AAIVQRAMAA TSYMNVAPQL DVASTNANTP IALGIPAVTI GSGGTGGGEH SLNEWWLNDK
GYLGMQRILL VLLAEAGLDK GAAASKVKAE KRR