Gene Slin_0231 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSlin_0231 
Symbol 
ID8723959 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSpirosoma linguale DSM 74 
KingdomBacteria 
Replicon accessionNC_013730 
Strand
Start bp309231 
End bp310736 
Gene Length1506 bp 
Protein Length501 aa 
Translation table11 
GC content53% 
IMG OID 
Productprotein of unknown function DUF404 
Protein accessionYP_003385095 
Protein GI284035165 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value0.0467865 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAAAAC AAAACGCCCG TCAATCACAA TCGCAAACAC TCAATGGTAT GACTCAGTCG 
CAGGGTCAGG CCAATGCTAA TTTTTCGTTC AGCGATTACC AGACTGAAAA CTTTTTTGAT
GAGATGTTCG CCAGTGAGAT GCAGGTGAGA GCAGGCTACG CTCCTTTCCA GCAGCGCGTT
GAGCAGCTTA CCCGCGAGGA TCTTATTGGG CGACAGCATG CAGCCGAACG GGCACTCATG
AGCATGGGCA TCACCTTCAA CGTTTACTCG GAAGGTGAAG GCACCGAGCG GATTATGCCC
ATCGACATTA TCCCCCGTAT TATCGAATCG GCCGAGTGGG ACCGGCTCGA AGCGGGCCTC
ATCCAGCGTA TTAAAGCCAT TAATATGTTT CTGGACGACG TCTACAACGA TCAGAATATT
CTGAACGACG GCGTTGTTCC CCGCGACCTT ATCGAATCCA GCAAGTCGTT TTTGCCGGGC
TGCTTAGGTG TAAAACCGCC CAAAGGCATC TGGTGCCACA TTACCGGCAC CGACCTGATC
CGGGGCGAAG ACGGTACCAT GATGGTGCTT GAAGATAACC TTCGTTGCCC ATCGGGGGTA
TCGTACATGC TCGAAAATCG CGAACTCAAT AAGCAAACCT TCCCCGATGT GCTGGCCCAG
ACGGGCGTTC GGCCGGTTTC GGATTACCCA ACGCGACTGT TGCAGATGTT GCAGTACATT
GCCGACCGGC CCAACCCAAC CGTAGTAGTC CTAACGCCGG GTATCTATAA CTCCGCTTAT
TTCGAGCATT CGTATCTGGC TCAGCAGATG GGCGTCGAAC TGGTCGAAGC GCGTGATCTA
GTTGTATCGG GTGGTTACGT AAAAATGCGC ACGACCAAAG GCTTTCAGAT CGTCGACGTG
ATCTACCGCC GTATTGATGA TACATTCCTG GACCCCAAAG CCTTCAATCC CGATTCGATG
ATTGGCGTAC CGGGCATTTT CGAGGTGTAC AAAAAAGGTC GTGTTGCGCT GGCCAACGCC
CCCGGAACCG GTGTTGCCGA TGATAAAGTG ATTTACGCTT ACGTACCCCG CATCATTAAA
TATTACATGG GCGAAGAAGC TATCATTCCC AACGTAAAAA CGTATATCTG CCGCGAAGAG
GAGGACTGCG CTTACGTCAT GGAAAATATT GAAAAACTGG TGGTTAAGGA AGCCAATGAA
GCGGGCGGTT ATGGTATGCT CATCGGCCCG AAGGCGACGC CGGAAGAACA CGAATTATTC
CGTCAGAAGA TCAAGGACAA TCCCCGGAAT TACATCGCCC AGCCAACCAT TTCGCTGTCA
CGCGTGCCCT GCATTGTGGG CGACCATGCC GAAGGCCGAC ACGTTGACCT TCGGCCGTAT
ATTCTCTACG GCGACGGCGT CAACGTCATT CCCGGCGGCC TTACCCGCGT AGCCCTGCGC
AAAGGCTCCC TCGTGGTCAA CTCCTCACAG GGCGGTGGCG GTAAAGACAC ATGGGTGTTG
TATTAG
 
Protein sequence
MKKQNARQSQ SQTLNGMTQS QGQANANFSF SDYQTENFFD EMFASEMQVR AGYAPFQQRV 
EQLTREDLIG RQHAAERALM SMGITFNVYS EGEGTERIMP IDIIPRIIES AEWDRLEAGL
IQRIKAINMF LDDVYNDQNI LNDGVVPRDL IESSKSFLPG CLGVKPPKGI WCHITGTDLI
RGEDGTMMVL EDNLRCPSGV SYMLENRELN KQTFPDVLAQ TGVRPVSDYP TRLLQMLQYI
ADRPNPTVVV LTPGIYNSAY FEHSYLAQQM GVELVEARDL VVSGGYVKMR TTKGFQIVDV
IYRRIDDTFL DPKAFNPDSM IGVPGIFEVY KKGRVALANA PGTGVADDKV IYAYVPRIIK
YYMGEEAIIP NVKTYICREE EDCAYVMENI EKLVVKEANE AGGYGMLIGP KATPEEHELF
RQKIKDNPRN YIAQPTISLS RVPCIVGDHA EGRHVDLRPY ILYGDGVNVI PGGLTRVALR
KGSLVVNSSQ GGGGKDTWVL Y