Gene Slin_4394 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSlin_4394 
Symbol 
ID8728154 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSpirosoma linguale DSM 74 
KingdomBacteria 
Replicon accessionNC_013730 
Strand
Start bp5328484 
End bp5330211 
Gene Length1728 bp 
Protein Length575 aa 
Translation table11 
GC content51% 
IMG OID 
Productcarboxyl-terminal protease 
Protein accessionYP_003389174 
Protein GI284039244 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones35 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATGGAG ATGGTAACGC TATGAAAACG CCGGGCATGC CTGATGGTCA GGACAATAAG 
AACCGGATTC AGAATGATAA AGCAACCGTG CGTATACCCA TGCTGCTGGG CATTGCACTC
GCAGGGGGCA TGTTAATTGG GGCGACGTTC TTTGGTGGCA CCCAGAGCAT GAATAATATC
GGACGGGGAT ACAGCAAATA CAAGGAGATT CTGCAACTCA TCGAGAACAA CTATGTCGAT
ACTGTCAATA CCGACGATCT GGTCGATTAC TCGATTACCA AAATGCTGGA GAAGCTCGAT
CCGCATACGG CCTACATGAA CCCGCAGGAT GCCGTGGCCG CCCGGTCGCA GCTGGAAGGT
GGATTTGACG GCATTGGCGT TGAGTTCAAC ATTTACAAAG ACACCGTTTA TGTAGTAACG
CCCCTGGCCG GTGGCCCCTC CGAAACGGCG GGTATCCAAA GTGGCGACAA GATCATTAAG
GTCGATGATA AACCCCTGGC CGGTGGCAAA ATAGAAAACA GCGCCGTGTT TAAAGCCCTG
CGCGGCAAAC GCGATACGAA TGTTAAATTG ACTATTCTGC GAAAAGGCGA CAAGCAACCC
AAGGAGTTTA CGATTACACG GGGCCGTATT CCGACGTACT CGGTCGATGC TGCCTATATG
ATTGATGCCA AAACCGGTTA CATCAAAATA AACCGGTTCT CGGAAACAAC GTATGACGAG
TTCAAGACGG CGCTGGCGTC GCTCAAGGCA AAAGGGATGT CGCAGTTGAT GATGGATCTG
CGGAATAACC CCGGTGGATA CATGGACCGG GCTACCAACA TTGCCGACGA ATTTATTTCG
GGCAACAAAC TGCTCGTCTA TACCGATGGC AAAGACAACC GGTACGACCG TAAAACGATG
GCCCATATTG CAGGGCAGTT TGAAGAAGGT GCCCTGGTTG TGCTTATCGA CGAAGGCAGT
GCATCGGCTT CTGAAATTGT GTCGGGTGCA TTGCAGGATC ACGACCGTGC CTTGATTGCT
GGTCGGCGGT CGTTCGGGAA GGGGCTGGTA CAAATGCCGG TAACCCTGTC TGACGGTTCC
GAACTGCGTC TGACCATTTC GCGCTATTAT ACACCCAGCG GCCGTAGTAT TCAGAAACCG
TACGTGCCGG GTCAGGAGGG CGATTATGAA AAAGACCTCG AACTGCGCTC AAAGCGGGGT
GAGTATTACA TTGCCGATTC GATCAAAAAC GACCCCAAAC TGAAGTTTAA AACCGACGGC
GGACGGGTTG TATACGGCGG TGGCGGTATC ACGCCGGACT ACTTCATTCC CCGGGATTCA
ACCTGGCAGA CGGCGTATCT GGTGCAACTT TACGGCAAGA GTATTATCCG CGAGTTTGCA
ATGGAATATG CCAATGACAA CCGAAAGAAA CTGGAAAAAA TGTCGTTTGA AGAGTTCGAT
AAGGCGGTTA CTATCAACGA TGAGCAGATG AACCGTTTGG TGAAAGACGC TACGGCGGAG
GGCATCAAGT TCAACGAGAA AGAGTACAAC CGCTCTAAGA ACTACATCCG TACGCAGATA
AAAGCGCTGG TAGCCCGGTC TATTTTCCAG AAGAACAACA AGGGGGGGCA AAACAATGAA
TTCTTCCGAA TCATTGCCCA GACGGACGAC ACTTATCAGA AGGCACTGAA ACTCTTTGAT
CGGGCCAACA AGCTCGAACA TGGAGCGATG ACGTATAATC AGAAGTGA
 
Protein sequence
MDGDGNAMKT PGMPDGQDNK NRIQNDKATV RIPMLLGIAL AGGMLIGATF FGGTQSMNNI 
GRGYSKYKEI LQLIENNYVD TVNTDDLVDY SITKMLEKLD PHTAYMNPQD AVAARSQLEG
GFDGIGVEFN IYKDTVYVVT PLAGGPSETA GIQSGDKIIK VDDKPLAGGK IENSAVFKAL
RGKRDTNVKL TILRKGDKQP KEFTITRGRI PTYSVDAAYM IDAKTGYIKI NRFSETTYDE
FKTALASLKA KGMSQLMMDL RNNPGGYMDR ATNIADEFIS GNKLLVYTDG KDNRYDRKTM
AHIAGQFEEG ALVVLIDEGS ASASEIVSGA LQDHDRALIA GRRSFGKGLV QMPVTLSDGS
ELRLTISRYY TPSGRSIQKP YVPGQEGDYE KDLELRSKRG EYYIADSIKN DPKLKFKTDG
GRVVYGGGGI TPDYFIPRDS TWQTAYLVQL YGKSIIREFA MEYANDNRKK LEKMSFEEFD
KAVTINDEQM NRLVKDATAE GIKFNEKEYN RSKNYIRTQI KALVARSIFQ KNNKGGQNNE
FFRIIAQTDD TYQKALKLFD RANKLEHGAM TYNQK