Gene Slin_5890 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSlin_5890 
Symbol 
ID8729668 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSpirosoma linguale DSM 74 
KingdomBacteria 
Replicon accessionNC_013730 
Strand
Start bp7132157 
End bp7133518 
Gene Length1362 bp 
Protein Length453 aa 
Translation table11 
GC content53% 
IMG OID 
ProductFmu (Sun) domain protein 
Protein accessionYP_003390652 
Protein GI284040722 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.133324 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value0.00389453 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCAGACGC AGTTGGGAGC AGAGTTTGCC AAATTTGAGT CCGCACTGGT GGAGCCAACC 
CCGGTGAGTA TCCGCATTAA TGCCCGAAAA TTGGGTGGGG CCGCTTATGA TCCCACCGAT
CTGGTGCCTG TTCCCTGGTG TCCCGATGGC TATTACCTGC CCGAACGCCC CAGTTTCACG
CTTGATCCGT TGTTTCAGGC TGGTGCTTAT TATGTGCAGG AGGCTTCGTC GATGTTGTTG
CACGAAGCCC TCCGGCAAAC GGTCAATCTC GACCGGCCGC TAAGGGTACT AGACCTATGC
GCGGCTCCGG GTGGGAAAAG TACGTTGCTG GCGTCGGCCC TGCACCCCGA TAGCCTATTG
GTATGTAATG AAGTGATACG TAGCCGGGTG TCGGTCCTGC GCGAGAATCT GGATAAATGG
GGTTACCCAA ATGTGGTGGT CAGCAACCAC GACCCGGAAG ACATGAGCAA GCTGACGGGT
TTTTTCGATG TTGTGCTGGT CGATGCACCC TGCTCGGGCG AAGGCCTGTT TCGAAAAGAT
CCCGACGCTA TGCAGGAGTG GTCGGAAGCG AGTGTTGATC TGTGCTCAGC CCGGCAGAAA
CGGATTCTGG CCGCAGCTGC ACCTTTACTC GATAAAGACG GTATTCTGAT CTATAGCACC
TGTACATATA ATGATAGAGA GAACGCCGAA AACGTTCGAT ATCTGACCGA AATCGGGTTT
CGTAATAAGC CGCTTATTCT GCCATCGGAA TGGAATATTG TGGAGCGACA GGCGGGCGAT
CCGGAAACGG GTGAGGCCGT CGGGTATCAA TGCTACCCGC AGCGGGTTCG GGGCGAAGGC
TTTTTTATCA GTGCCTTTAA AAAAACGGGC TTTACGGCTC CGGTAAAACT CGATGCCCGA
ACGTTTCGGA CCATTCGTGC CCTTCGACCC CGCGAAACGG CTTCAGCGGC CAAGTGGCTT
CAGAATCCAG CCGATTTTTC GTTCTGGGAG AAACCCAATG GCGATGTGAT GGCCCTGCCT
AAAGCACTCG AAAAAACGTA CCTATTTCTC GACAGTGCTT TAAAGAGTAA AGGCTTTGGG
TTAGAGATGG GGCAGTTTAA AGGAACGGAC TTTGTACCCT CGCACGCGCT GGCGCTGAGT
ACGGCGGTTA ACCAAGACCT GCCGGGGCTC GAATTGAGTA AGGAAGACGC CCTGCGCTAC
TTTAAGAAAG AGAATCTAGT ATTTGATGAA CCCGTAAAAG GCTGGCTACT CGCCAAATAT
AAAGGGGTAA ATCTGGGTTG GGTAAAAGGA GTAGGTACTC GCGTTAATAA CTATCTTCCG
AAAGACTGGC GAATCAGAAT GGATATAAAG GAGTACGTAT GA
 
Protein sequence
MQTQLGAEFA KFESALVEPT PVSIRINARK LGGAAYDPTD LVPVPWCPDG YYLPERPSFT 
LDPLFQAGAY YVQEASSMLL HEALRQTVNL DRPLRVLDLC AAPGGKSTLL ASALHPDSLL
VCNEVIRSRV SVLRENLDKW GYPNVVVSNH DPEDMSKLTG FFDVVLVDAP CSGEGLFRKD
PDAMQEWSEA SVDLCSARQK RILAAAAPLL DKDGILIYST CTYNDRENAE NVRYLTEIGF
RNKPLILPSE WNIVERQAGD PETGEAVGYQ CYPQRVRGEG FFISAFKKTG FTAPVKLDAR
TFRTIRALRP RETASAAKWL QNPADFSFWE KPNGDVMALP KALEKTYLFL DSALKSKGFG
LEMGQFKGTD FVPSHALALS TAVNQDLPGL ELSKEDALRY FKKENLVFDE PVKGWLLAKY
KGVNLGWVKG VGTRVNNYLP KDWRIRMDIK EYV