Gene Slin_3001 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSlin_3001 
Symbol 
ID8726752 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSpirosoma linguale DSM 74 
KingdomBacteria 
Replicon accessionNC_013730 
Strand
Start bp3633217 
End bp3635247 
Gene Length2031 bp 
Protein Length676 aa 
Translation table11 
GC content48% 
IMG OID 
ProductATP-dependent metalloprotease FtsH 
Protein accessionYP_003387811 
Protein GI284037881 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.0623369 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones35 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCAGAAA ATAATAATAG AAATCCGTTA GTACCCCGTG GTGGGCCACG TAAACCCAAT 
TTCCAGGGCT GGATTGTTGC GTTGCTGATT GCCGCTATTC TTGGCATTAC GTTCTTTAAC
AGAAGTTCGG CTACCCAGGA AATCTCGCAG AAGCAGTTTG AGCGTATGGT GAAAGATCAT
GAGGTCGCCG AGGCTATTGT CGTTAATGAC AAGATCGCCG AAGTGACGCT TACGCAACAA
GCCGCTCAGA GTCCCAGATA CAAGAATCGA TTTGCCGATA AGCCCTATTT TGGGAACAAC
CAGGGACCCC ATTTTCAGTT TCAGATTGCC TCTGGCGAAA CGTTCAAAAA GGATTTGGAC
CAACTTCAGC AAGGGCAACC CGACAATGAA AAAATCGACC TTAAGTTCGA AACCAGAAGT
GACTTTGGCA GCATCATCAG TACCTGGGGT TTCCTGATCG TGATGATACT GGCCATGTAT
TTCCTGCTCG GACGTATGTC TGGAGCCGGT GGGCCTGGAG GGCAGATTTT CAACATTGGA
AAATCTAAAG CTGCCCTGTT CGATGCTGAT AACAAGGTAA AGATCACGTT CAACGATGTT
GCCGGCCTTG ACGAAGCAAA GGAGGAGATC AAAGAAATTG TTGATTATCT CAAGAATCCA
ACCAAGTTCA CGAAGCTGGG TGCAAAAATT CCTAAAGGTG CTTTGCTCAT AGGCCCTCCG
GGTACAGGTA AAACCCTGCT GGCAAAAGCC GTTGCTGGCG AAGCGGGTGT TCCCTTCTTC
TCCCTGTCGG GTTCTGACTT CGTTGAGATG TTCGTTGGTG TGGGTGCGGC ACGGGTGCGC
GACCTGTTTA AGCAGGCAAA AGAGAAAGCA CCTTGTATCA TCTTTATCGA TGAGATTGAT
GCGGTAGGGC GTTCACGTGG TCGTGGTTCT ATGCCCGGTG CAAATGATGA GCGGGAAAAC
ACCCTCAACT CATTACTCGT GGAAATGGAT GGCTTTGCCA CCGACTCGGG TATCATTATT
TTGGCTGCAA CCAACCGCCC TGATGTACTG GACTCCGCCT TGCAGCGTCC AGGCCGTTTT
GACCGTCAGA TCAGCATCGA CAAGCCGGAT ATTATTGGCC GCGAAGCTAT CTTCCGGGTC
CATTTAAAGC CGATCAAACT GGCTGCTGAT GTTGATCCTA AAGAGCTGGC AGCTCAAACC
CCCGGTTTTG CGGGTGCAGA AATTGCCAAC GTTTGTAACG AGGCTGCTCT TATTGCTGCT
CGTAGTGATA AAGAAGCTGT TGATATGAAA GACTTCCAGG ATGCGATGGA TCGTGTGATT
GGTGGTCTGG AAAAGAAGAA CAAGATCATA TCTCCGGAGG AAAAAGAGAT CGTAGCCTAT
CACGAAGCTG GTCACGCAGT GGCAGGCTGG TACCTTGAAC ATGCCGACCC CCTCGTAAAA
GTGACGATTG TACCGCGTGG TGTAGCTGCG CTGGGATATG CTCAGTATTT ACCTCGCGAA
CAGTACCTGT ACCGTACTGA GCAGCTTATG GACGAGATGT GTATGGCGTT AGGTGGCCGT
GCTTCCGAAG ATCTGATCTT TGGTAAAGTA TCTACCGGTG CGCTAAGCGA TTTGGAGCGA
ATTACCAAAC TCGCTTATAG CATGGTGACG ATGTATGGCA TGAACGATAA AATTGGTAAC
GTATCTTTCT ACGATTCCAA ACAGTCGGAT TATAACTTCA ACAAGCCTTA CTCGGAAGAA
ACGGCCAAGC ACATTGATGA TGAAGTTCGT AAAATCGTTA GCCTAGCTTA TGAGCGTACG
AAAAATCTGT TGACTGAGCA CCGCGATGCC CTGGAAATTC TGGCTAAAGA GTTACTTGAA
AAAGAGATTC TTTATCAAAA TGACTTGGTT CGTCTGATCG GTAAGCGTCC ATTCGAGCGT
GAAACGGTTT ATCAGGCTTA CAAGAACAAA GGAGTAGCTG AAGAAGTGAA GGAAGAGATC
GGCAAGGAGT CGAAACCTGC CGAAACGGAA CCAGAATCGC TTCCTATTTG A
 
Protein sequence
MAENNNRNPL VPRGGPRKPN FQGWIVALLI AAILGITFFN RSSATQEISQ KQFERMVKDH 
EVAEAIVVND KIAEVTLTQQ AAQSPRYKNR FADKPYFGNN QGPHFQFQIA SGETFKKDLD
QLQQGQPDNE KIDLKFETRS DFGSIISTWG FLIVMILAMY FLLGRMSGAG GPGGQIFNIG
KSKAALFDAD NKVKITFNDV AGLDEAKEEI KEIVDYLKNP TKFTKLGAKI PKGALLIGPP
GTGKTLLAKA VAGEAGVPFF SLSGSDFVEM FVGVGAARVR DLFKQAKEKA PCIIFIDEID
AVGRSRGRGS MPGANDEREN TLNSLLVEMD GFATDSGIII LAATNRPDVL DSALQRPGRF
DRQISIDKPD IIGREAIFRV HLKPIKLAAD VDPKELAAQT PGFAGAEIAN VCNEAALIAA
RSDKEAVDMK DFQDAMDRVI GGLEKKNKII SPEEKEIVAY HEAGHAVAGW YLEHADPLVK
VTIVPRGVAA LGYAQYLPRE QYLYRTEQLM DEMCMALGGR ASEDLIFGKV STGALSDLER
ITKLAYSMVT MYGMNDKIGN VSFYDSKQSD YNFNKPYSEE TAKHIDDEVR KIVSLAYERT
KNLLTEHRDA LEILAKELLE KEILYQNDLV RLIGKRPFER ETVYQAYKNK GVAEEVKEEI
GKESKPAETE PESLPI