Gene Slin_4453 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSlin_4453 
Symbol 
ID8728213 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSpirosoma linguale DSM 74 
KingdomBacteria 
Replicon accessionNC_013730 
Strand
Start bp5394371 
End bp5395600 
Gene Length1230 bp 
Protein Length409 aa 
Translation table11 
GC content50% 
IMG OID 
Producthypothetical protein 
Protein accessionYP_003389233 
Protein GI284039303 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones41 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGACACA CAAGATTATT AATAGTCGGT TTTGTTTGGT TAGGTTGTCT GGCTGTTCAA 
GCGCAAATAA AACCCCAGAC GGGTTTTTGG CGTGGCGTCT TTACTATGGC CGGCGGGCAT
ACGGCTCCGT TTAACCTTGA GCTGACGGGC AAAACGGCTT ACCTGCTCAA TGGTACCGAA
CGCTTTGAAC TGAAAAACGT GACGCAGCGG GGCGATTCGC TGATTATTCC GGTCGATGTG
TACAATACGG TACTGGCGGC AAAGGTTGAG GATGCCAAAA CGTTGTCGGG CGTATTTAAA
CACCTGGAAT CTCCCACAAC GGGTGTCCCT TTCCGGATGG AGCACGGAAA ACGGTATCGG
TTCGTTGAAA ATCAGGCTGC TCCTGTGGTG AGCATGCATG GCAAATGGGA CATCCTTATC
GACGAAAAAA TCAAACTCAT CGGTGTATTC GAACAACACG GCAGCAAATT GACCGGTACT
TTTCTGAGTA CGGGTGGAGA CATGCGTTAC TACGAAGGCT CAGTGCAGAA CGATGAATTT
GCCTTGTCTG CGTTTGACGG CTCCAACCCG CAGCTGTTTA TCGGTAAGAT CAGTGGTAAC
GAATTAAGTG GCAGCTTCGT CAATAGCCGA CAGGTACGTT CATTGAAAGG CACCCGGAAT
GCACAGGCAG CTTTGCCCGA CGCTTACAGC CTGACAAAAA TGAGAGAAGG GATTCCTTTC
ACGTTTACCT TTCCCGATGG GTTTACGGGC AAACTCGTAT CACTAAGCGA CCCTAAGTAC
AAGAATAAAG TGGTCATCGT GACCACGATG GGAAGCTGGT GCCATAACTG CATGGACGAA
GCGGCTTTTC TAGCGCCCTG GTACAAGGCT AACAAAGATC GGGGTGTCGA AATAATTGGT
CTGGCTTTTG AAGTGAAAAA CGATCCGGTT TTCGCCAAAG CCCGTCTCGA AACGGTTAAA
AAACGGTACC AGATTGGCTA TGATATGCTC TTCGCGGGTA TTGCCGACGA AAAACACGCG
TCAGCCGTAT TACCCGCCCT GAGCGAGATG TCAGTGTACC CTACCACGAT TTATGTAAGA
CGTAATGGCG AAGTGGCCAA AGTGCATACC GGCTACTCTG GGCCAGCCAC CGGACAGTAT
TACGAAGCGT TTATCAAGGA GTTCAATGCC GAGATGGACC AGTTGCTCAA TGAGCCGATT
TCAGACAGGG CACCGGGTAA GGCTAACTAA
 
Protein sequence
MRHTRLLIVG FVWLGCLAVQ AQIKPQTGFW RGVFTMAGGH TAPFNLELTG KTAYLLNGTE 
RFELKNVTQR GDSLIIPVDV YNTVLAAKVE DAKTLSGVFK HLESPTTGVP FRMEHGKRYR
FVENQAAPVV SMHGKWDILI DEKIKLIGVF EQHGSKLTGT FLSTGGDMRY YEGSVQNDEF
ALSAFDGSNP QLFIGKISGN ELSGSFVNSR QVRSLKGTRN AQAALPDAYS LTKMREGIPF
TFTFPDGFTG KLVSLSDPKY KNKVVIVTTM GSWCHNCMDE AAFLAPWYKA NKDRGVEIIG
LAFEVKNDPV FAKARLETVK KRYQIGYDML FAGIADEKHA SAVLPALSEM SVYPTTIYVR
RNGEVAKVHT GYSGPATGQY YEAFIKEFNA EMDQLLNEPI SDRAPGKAN