Gene Slin_4047 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSlin_4047 
Symbol 
ID8727805 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSpirosoma linguale DSM 74 
KingdomBacteria 
Replicon accessionNC_013730 
Strand
Start bp4864733 
End bp4865845 
Gene Length1113 bp 
Protein Length370 aa 
Translation table11 
GC content32% 
IMG OID 
Producthypothetical protein 
Protein accessionYP_003388834 
Protein GI284038904 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.627457 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones38 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAACTAC CAAACAGTAC ACATATAGAT ATTTCCAAAC TTACAGGTAT TTTTACTAAT 
ACGACTAACT CCTACAAATT CTACTGGTTT TTAGCAATTT TAGATAGTAT AAAAGATAAT
GACCAAAAGA TAATCTCATT AAATGATATG GCATTGCGGA TGGTTGCAAA CGTCTGGTAT
CCGCTAGATT ATTTTAAATT ATCATTTGGT ACCGATGACA GTTTTATAAA AATTTCTCAA
TTAGTTTCAA ACCGTATTAT TATTGACAAT AGTACAAACT CTGCCCCTCT ATTTGACCAA
CTACAGTCAA AACTATCAAA TAATGACTTA ACAGAAGTTT ATAGAACAGT TAATAAGATA
GTACGTTATG TTCCTTACCG ATTCGTAAGA CCTTTCATAA ATGAACTTCT TACTGTTTCA
AAGGACAGCC AGGTAAATTT AAATATAAAG GAAATTTGTA ACAGTATATT TCATACATCA
CCTAATAAGG TTATGTATCG TTTTATAGAT AACTCTATTG AGTTAAGCGA TGTATGGAAA
GATTATTTAA AAACGCATCA AGGAATATTA CGTGGGTTTA TCTATTGGCA TCTTTTGCAA
TTTTTACAGA AAAACAATCC AAATGTCATT GGCTTACCTG ATAAACTTTT CAAACGAAGT
ACACGCGAGC CAAAACGAGG AAATGAGTTT TGGAAGCCCT ATATAGAGAT TAACCCAGAT
TTAAAATGTG TCTATTCTAG ACAACGAATT AATTCTCAAA ACATGAGTTT AGACCATTTT
TTACCTTGGT CGTATGTAGT GCATGATTTA CTCTGGAATA TAATCCCTAC CACTAAGAGT
GTTAATTCGG CAAAAAGCAA TTCATTACCA TCAATAGATC TATACTTCGA AGATTTCGCT
CAGTTACAAT ATGATGGCTT TCATTTTCAT GTTCTAAAAG GAAATGAAAA ATTACTTGAA
GACTATTCTG TGCTTTTCGG GCAGAATATA CACTCTATTC AGGAACAGCC TTTTGCCTTG
TTTCGTAAAC GGCTAGAGCA AACTATTTTA CCCCAGTTAC AAATGGCTCA AAACTTAGGC
TTTATTTATC CATTTATATT TTCTAATTTT TGA
 
Protein sequence
MELPNSTHID ISKLTGIFTN TTNSYKFYWF LAILDSIKDN DQKIISLNDM ALRMVANVWY 
PLDYFKLSFG TDDSFIKISQ LVSNRIIIDN STNSAPLFDQ LQSKLSNNDL TEVYRTVNKI
VRYVPYRFVR PFINELLTVS KDSQVNLNIK EICNSIFHTS PNKVMYRFID NSIELSDVWK
DYLKTHQGIL RGFIYWHLLQ FLQKNNPNVI GLPDKLFKRS TREPKRGNEF WKPYIEINPD
LKCVYSRQRI NSQNMSLDHF LPWSYVVHDL LWNIIPTTKS VNSAKSNSLP SIDLYFEDFA
QLQYDGFHFH VLKGNEKLLE DYSVLFGQNI HSIQEQPFAL FRKRLEQTIL PQLQMAQNLG
FIYPFIFSNF