Gene Slin_5203 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSlin_5203 
Symbol 
ID8728969 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSpirosoma linguale DSM 74 
KingdomBacteria 
Replicon accessionNC_013730 
Strand
Start bp6350188 
End bp6352194 
Gene Length2007 bp 
Protein Length668 aa 
Translation table11 
GC content55% 
IMG OID 
ProductOmpA/MotB domain protein 
Protein accessionYP_003389974 
Protein GI284040044 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.543853 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones36 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAATCA ATACGTTATC TGTACTGCTC GTCGTGTTCG GTCTGGCCAG CGGACTTACG 
GGCTGCAATT CGGCCATGCA GGCGTATAAA AAAGGAGTTC GTCACTATGA CGCCGGTGAA
TATAATCTGG CACTCACCCA GTTCCAGAAG GCAGCCAAAG GCTCCATCGA CCCCGCCCGC
CTGAATTATT ATACCGCCGA ATCGTACCGA CTGTCGAACC GGTTTGGCGA GGCTGTTCCG
TTCTACCAGA AAGCCATCGA AGCCAACACG ACGGAGCCGG ATGCCCGCTT CAACTATGCC
TATGCTCTGA AATCGCAGGG GAATTATACC GGGGCGCTGG AGCAATTGCA GCAGTATGTG
GCCAATGCAC CGAAAACGAC CGCTAAAGCC ACGCTCGATA AAGCCCGTCG CGAAGTGGAG
ACGTTGAAAG CTATCAACAT CATTGCCCAG AACAAGTCGC TCATTACGCT TCGTAACATG
AGCAATCTCA ATTCGCCGGG GGCCGAGTTT GCGCCCGTGG TGCGGGGAGA GGAGCTGGTC
TTTACTGCCT CGCGGAAAGA AACAGTTTAC AAGAACAACG GCCAGCCCAT GCTGGGGCTG
TACAAAACCA AATTGAACCA GAAACCCGAC GAAACGGGTA GCACCGGTGG CGCATCGGGC
CAGCCGGAAC CGTTCAGTAC AAACGTGTTT CAGGGTGATG TGAACGAAGG AACGCCCGCG
TTCTCGAAAG ATGGCAAAAC CATGATTCTG GCGCGGGGCA ACAATGGCAA GCGTAAAGGT
GGATTGGATG TTGATCTCTA TATCAGCCGC TTGGGCGACA ATAATACCTG GAGCCAGCCA
CTTCGTCTGC CCATCAGCGA CTCGCTGGCG TGGGATGGAT CGCCCGCGTT TTCGGCCGAT
GGCAAAACAT TGTATTTCGC ATCGAACCGG GCCGGTGGTG CCGGTGGTAT CGACTTATAC
CGGACCAGCA TCGACGCGTC GGGCCGTTTC AGCCGACCCG TTAACATGGG CCGCGACATC
AATACACCGG GCGACGAGAT GTTCCCGTAT GTGGGGGCCA ACGCCAAACT GTACTTTGCC
TCCGATGGGC ATCCCGGCTT AGGTAAACTC GATATTTTCG TGGCGACCCG TTCGGGCGGG
GTGACACGGG TGGAAAATAT GGGCCAGCCC ATCAACTCGC CCGCCGATGA TTTCGGACTG
ATTTACACCG ACCCAACCAA AGGCTTCATG GCTTCGAACC GGGGCGGGGG TAAAGGCGAT
GACGACATCT ATTTCTTCCA GGAAGGGCCA TCCGTCGATT CGACAACGAT TGTTCAGACG
CCACCGGCGA ACGCGCCTAA AATTGTGCGT TACTTCATTG CCGGAACGGT TTCGGCTAAC
GAAACACCCA TTGTTCCGCT TGATTCGGCA CGGGTTCGCA TTTTGGACGA TGCCACGGGG
CAACCCATTG CCGAAGCTAC AACCGGGCAG CCGGGAACAT TTGGTAAGTA CCCGTTGCAG
GAAGGTAAGG ACTACACCAT TCTGGCCGAG CGCCGGGGTT ACCTGACCCG CCGGGAGCAG
TTCACGATGC AGGGCAAGAG TATCCCGGCG ATCTTCCTGA CGAAAGCGCA GACCGATACT
ACCTTCAACG TGGCCCTGCT GCTCGACCGG GCGACGCTTA ACAAGACGTT TGTACTGGAG
AATATCTATT ACGATCTCGA TAAGTTCAAC ATCCGCCCTG ATGCCGCCGA AGAACTCGAT
AAGCTGGTGA CAATTCTAAA AGACAACCCA ACGCTGAAAA TCGAGTTGAG TTCACATACC
GACGTTCGTG CGCCGGATGC CTACAACATG CGCCTGTCGC AGAACCGGGC TAAGTCGGCC
GTTGATTACA TTGTCTCGAA AGGTGTTGAC GCCAGCCGGA TGATCGCTAA AGGGTACGGC
GAAACGCAGT TGATCGTTAA GAACGCCAAA ACGGAAGAAG AACACCAGCG TAACCGCCGA
ACAGAGTTCA AGATTCTGGA ACTTTAG
 
Protein sequence
MKINTLSVLL VVFGLASGLT GCNSAMQAYK KGVRHYDAGE YNLALTQFQK AAKGSIDPAR 
LNYYTAESYR LSNRFGEAVP FYQKAIEANT TEPDARFNYA YALKSQGNYT GALEQLQQYV
ANAPKTTAKA TLDKARREVE TLKAINIIAQ NKSLITLRNM SNLNSPGAEF APVVRGEELV
FTASRKETVY KNNGQPMLGL YKTKLNQKPD ETGSTGGASG QPEPFSTNVF QGDVNEGTPA
FSKDGKTMIL ARGNNGKRKG GLDVDLYISR LGDNNTWSQP LRLPISDSLA WDGSPAFSAD
GKTLYFASNR AGGAGGIDLY RTSIDASGRF SRPVNMGRDI NTPGDEMFPY VGANAKLYFA
SDGHPGLGKL DIFVATRSGG VTRVENMGQP INSPADDFGL IYTDPTKGFM ASNRGGGKGD
DDIYFFQEGP SVDSTTIVQT PPANAPKIVR YFIAGTVSAN ETPIVPLDSA RVRILDDATG
QPIAEATTGQ PGTFGKYPLQ EGKDYTILAE RRGYLTRREQ FTMQGKSIPA IFLTKAQTDT
TFNVALLLDR ATLNKTFVLE NIYYDLDKFN IRPDAAEELD KLVTILKDNP TLKIELSSHT
DVRAPDAYNM RLSQNRAKSA VDYIVSKGVD ASRMIAKGYG ETQLIVKNAK TEEEHQRNRR
TEFKILEL