Gene Slin_3901 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSlin_3901 
Symbol 
ID8727659 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSpirosoma linguale DSM 74 
KingdomBacteria 
Replicon accessionNC_013730 
Strand
Start bp4679951 
End bp4681060 
Gene Length1110 bp 
Protein Length369 aa 
Translation table11 
GC content51% 
IMG OID 
Productproline-specific peptidase 
Protein accessionYP_003388690 
Protein GI284038760 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value0.0490645 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGCACGGG TTGCCCGGCA GGTCACAAAT TCAAATCAAT TCAACTCTAA AATCCCCTTG 
AAAACCAATC GCTTCCCTTT ACTTCTGGCC GCTTTTAATA TCGTGCTCAT TAGCCTGACC
GTTACAGGCT GTAACCCAAC TACAAACGGT AGCGAGGCTA ACACGCGCCA AACCTACTTT
ACCCCGGCCG ACACGGGTGT GCAAACCGGG GGCGTGACCG TAATCCCGAT CAAGACACCT
AAAGGCACAT TCAACGTATG GACAAAGCGG ATTGGCAACA ACCCGAAAAT TAAGGTGTTG
ATTCTGCACG GGGGGCCCGG TGTCAACCAT GACCCCTACG AGTGTTTCGA GAATTTTCTG
CCCAAAGAAG GCATTGAGTT CATTTATTAC GATCAGCTCG GCGCGGGCAA CAGCGACCGA
CCAACTGACA AGAGCCTGTG GGTGTTACCC CGTTTTGTGG AAGAAGTAGA GCAGGTTCGG
ATGGCGTTGG GGTTAAACAA AGACAACTTT TACCTATACG GTCAGTCGTG GGGGGGCATT
TTGGGTATTG AATACGCGCT TAAATACGGC CAAAACATCA AGGGTTTAAT TATCTCGAAC
ATGATGAGCA GCGCGCCCGC CTACAGCCAG TACGCCACCG ACGTACTCGC CAAACAAATG
GATCCGAAGG TGCTGGCCGA GATCAAAACC CTTGAAGCAA AAGGCGACTT CACCAACCCG
CGCTATATGG AACTGTTGCT GCCCAATTTT TACGAAAAGC ATATCTGCCG GTTTCCAACG
GCGCAGTGGC CCGAACCGGT GAATCGGGGG TTGGCCAAAC TGAACCAGGA GCAGTATGTG
ACTATGCAGG GACCAAGCGA GTTTGGCATG GCGGGTGATG CTAACCTAAA GAACTGGGAT
CGTACCAAAG ACCTGCCCAA AATCACAGTG CCGACGCTTG TTATCGGCGC CACCTACGAC
ACGATGGACC CCAAACACAT GGCGATGATG GCCAGACAGG TCAAAAATGG CACTTTCCTG
CTCTGTACCA AGGGTAGCCA TCTGGCGATG TACGACGACC AGCAAACGTA TTTCACCGGA
TTGATCTCTT TTTTAAAAAA AGGGAATTGA
 
Protein sequence
MARVARQVTN SNQFNSKIPL KTNRFPLLLA AFNIVLISLT VTGCNPTTNG SEANTRQTYF 
TPADTGVQTG GVTVIPIKTP KGTFNVWTKR IGNNPKIKVL ILHGGPGVNH DPYECFENFL
PKEGIEFIYY DQLGAGNSDR PTDKSLWVLP RFVEEVEQVR MALGLNKDNF YLYGQSWGGI
LGIEYALKYG QNIKGLIISN MMSSAPAYSQ YATDVLAKQM DPKVLAEIKT LEAKGDFTNP
RYMELLLPNF YEKHICRFPT AQWPEPVNRG LAKLNQEQYV TMQGPSEFGM AGDANLKNWD
RTKDLPKITV PTLVIGATYD TMDPKHMAMM ARQVKNGTFL LCTKGSHLAM YDDQQTYFTG
LISFLKKGN