Gene Slin_2589 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSlin_2589 
Symbol 
ID8726334 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSpirosoma linguale DSM 74 
KingdomBacteria 
Replicon accessionNC_013730 
Strand
Start bp3129092 
End bp3131077 
Gene Length1986 bp 
Protein Length661 aa 
Translation table11 
GC content49% 
IMG OID 
Productprotein of unknown function DUF303 acetylesterase putative 
Protein accessionYP_003387406 
Protein GI284037476 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.541867 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value0.205494 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATAGAC TTTGCATTGG AGTTATATTG TTTTTGTACT ATTTCCAAGC TCAGGGACAA 
CTAGTTTTTG AGCGACTACC GCGTGAATTA CAGTTATATC CACGCGATGC CAATAACCAG
GCCGAAGTTG TTGTAAGTGG TAAAATGGAT ACGCCCGGTT ATTCAAAAAT AACGATGCGA
ATGGCGCGGG AGGGCGTATT GACGAAAGTG GTAAGCCAGA GCCTGGAGCC ATCGAGCAGT
AATGCCCCGT TTAGTTTATC GACAACAATT AAAGCCGAAC CAGCCGAATA CTCCTTTCAG
GTATATCTCT TTAAAGGACA GGATTCGTTG CTGGTCGCTA ATTCCCAGCG AATTGTCTGT
GGCGATGTGT ATATCATACA TGGTCAGTCT AACGCACTGG CGCTGAGTGA TTTCGATGGG
TTGTACTCGT TTAATTTCAA TGACAGGTAC ATGCGAAATG TTGCTTACCC TTATCTTGGA
TTACCCTCTC AGATGAGTTG GTATCCGGCC AAACAGCCCT TTGCCAGCGT TGGTGGGTTA
GGCTTGACAT TGCAGCGGCT CATTCTTGAA AATTATGGTA TTCCTACCTG TGTCATCAAT
GGAGCCATGG GCGGAACACC CATCAGTGCC TTATCTGTTC GGGACCCGCT CAATCACGCC
AATCCGATTA CGTTCTATGG CGATCTGCTG AATCGGGCGC AATGGGCGGG GGTTGCCAAA
CAAACGAAAG CGATCATCTG GAAGCAAGGC GAAGAAGATG CGGGAAGTGG CCTGCCCGGC
TACCCGGCAA AGTTTGCCAC GCTCTATAAT CAATTCAGAG AAGATTACGG CAACGCCCGC
ATTTACGTAG GGCAAATCAA TATTTTAAAC AACCCACAGG ATAGTGCTGC TGCCCTGCGC
GACTTTCAGC GACGAACAAA ATACATCTTC AACAATGTAG AAACCATCGC TACCGTTGGA
ACACCGGGTT ATGATGGGGT TCATTACAGT GGAATTGCGC ACCAGCGAAT GGCTTTCGAG
CAGTTCAGAC TCATTGCCCG CGATATATAC GGGTCGAAGG ATACGCTCCA GATCAACTCG
CCGGATGTAA GAAAAGTCTT TTACAACAGC CGCAAAGATT CCATCACACT GGTTTTCGAC
GATCAGATGC AGATGGTCTG GAAAAACGAC ACCACCTTCT ACAATTTTGC AACGGGCGCT
AAAATTGCCT TCCGGGAGCA GAAGGATTTC TTTTATCTGG ACCGGCAGTC GGGCCTGGTA
ACGGGTGGTT CGGCCAACGG AAACCGGGTT GTTTTAAGTT TGAAACAACC GGCTTCGGCC
AAAACGATCC GCTATCTGCC TGCCTATTTT TCGGATGCCG CTTCGCCGTT CTACGACGGG
CCTACGCTTC GGAATACACG CGGAATGCGG GCCTTTTCGT TTGACAACGT CGCTATTGCC
GATGCGATCC CCGCTGTAAC CACGCTGGTA GCCAAGCCCA TTTCCGAAAA ACAGATACAA
TTAAGCTGGA CCGTCTTGCC AACGACCCAA AACCAGATTC TGGAACGGGC CACTGGCACC
CCCGCAAATT TTACACCGAT AGCTACCCTC GGCGGAACCG TCGGCGCGTA TAACGATACG
AATATTCCGG ATATCTTTGG CACCTATTAC TACCGTTTAC GGGCGTTCAG TGCTGTTTCC
GAATCTGCCT ACAGTAACGT CGCTAGTGCC CGCCCGCTGG TGCTGGGTAT CGAGCCTGGT
GAACCGCTTG TTAAGATTTA TCCAAATCCT GTGGCATCGG ACCGGATGTT GAATGTAGAA
GCAGATCAGG CTTTTTTTAC TGAATTAACC GTGCGTGATC TTCTGGGGAG AGCCGTAAAA
ACATGGCGTG GTACGCCTAA AAAGGCGATT TCGCTGGCGC TCGATAATCT GGAAGCGGGT
CTTTATATCA CCGACATTCA AACCGTCAGC GGGCATCTAA TTCGCCAAAA ACTGATTATA
CGTTGA
 
Protein sequence
MNRLCIGVIL FLYYFQAQGQ LVFERLPREL QLYPRDANNQ AEVVVSGKMD TPGYSKITMR 
MAREGVLTKV VSQSLEPSSS NAPFSLSTTI KAEPAEYSFQ VYLFKGQDSL LVANSQRIVC
GDVYIIHGQS NALALSDFDG LYSFNFNDRY MRNVAYPYLG LPSQMSWYPA KQPFASVGGL
GLTLQRLILE NYGIPTCVIN GAMGGTPISA LSVRDPLNHA NPITFYGDLL NRAQWAGVAK
QTKAIIWKQG EEDAGSGLPG YPAKFATLYN QFREDYGNAR IYVGQINILN NPQDSAAALR
DFQRRTKYIF NNVETIATVG TPGYDGVHYS GIAHQRMAFE QFRLIARDIY GSKDTLQINS
PDVRKVFYNS RKDSITLVFD DQMQMVWKND TTFYNFATGA KIAFREQKDF FYLDRQSGLV
TGGSANGNRV VLSLKQPASA KTIRYLPAYF SDAASPFYDG PTLRNTRGMR AFSFDNVAIA
DAIPAVTTLV AKPISEKQIQ LSWTVLPTTQ NQILERATGT PANFTPIATL GGTVGAYNDT
NIPDIFGTYY YRLRAFSAVS ESAYSNVASA RPLVLGIEPG EPLVKIYPNP VASDRMLNVE
ADQAFFTELT VRDLLGRAVK TWRGTPKKAI SLALDNLEAG LYITDIQTVS GHLIRQKLII
R