Gene Slin_3997 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSlin_3997 
Symbol 
ID8727755 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSpirosoma linguale DSM 74 
KingdomBacteria 
Replicon accessionNC_013730 
Strand
Start bp4808847 
End bp4810307 
Gene Length1461 bp 
Protein Length486 aa 
Translation table11 
GC content54% 
IMG OID 
ProductOuter membrane protein-like protein 
Protein accessionYP_003388786 
Protein GI284038856 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000000000837564 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAAGACGA TTATATTCGG TTTAAGTCTG TTGATTGGTA GTCAGTTAAT TTCGGCCGCT 
CAAGCACAGC CAGCGGGAAC ATCCTCTGAC ACAACCCTGT TTACAGCCGC CGATTTTTAC
CGGGTCGTTT TTAGCCATCA TCCCCTTGTT CGACAGGCGG CTCTGCTGAA CAGCGAAGCC
CAGCAGGAAA TTATGCAGGC CAGGGGTGCT TTTGACCCCA AGCTATTCTC AACCTACGAC
CGGAAAGAGT TTGGGCAGGA TTTGTATTAT AACAAATGGC AGTCCGGCTT GACAATACCC
GTTCTGCCAG CCGGTATCGA CGTAAAGCTG ACCTATGACC GGAATATGGG GCAGTATGTC
AACCCCGAAG AGCGGGTACC GTCGTCGGGT CTGGCGGCCG TGGGAGTACG CGTTCCCGTT
GGTCAGGGGC TGATTATCGA TGCCCGACGC AACGCCCTGC GCCAGGCTAA ACTTGCCGTT
ACCCTGGCCG ATGCCGAACG CCTGAAACTC ATCAACAAAA CCTTGTTCGA TGCGGCCAAA
ACCTATTGGG AATGGTATAT GGCTCACCAG CAGTATCGGC TGATCCAAAA CGGCTATCAG
GTAGCCGACA CCCGCTTCCG GGCCATCCAA CAGCGGTCAT TGCTGGGCGA TGCGGCCGCC
ATCGATACTA CTGAAGCGCT CATTACGGCC CAGGACCGCC TGGTGCAACT CCAGCAAGCG
GAAGTCAATC TGCAAAATGC CCGATTGCGG TTGAGTGTTT TCTTATGGAA CAGTACTGAT
AGCGATGGCA TGCCCCAACC GGTTGAACTT CTGCCTACAG TGGCCCCTCA GCCGGTGCCC
GCCGACCGGC TCGATGAAAG TACACTACAG GCCTTGCTGA GTAAAGCCGC CGAACGGCAC
CCGGAGTTGC TGAAACTAAC CACCAAAGGA CAGCAGCTGG CACTGGAAGA GCGATTTCAG
CGGTCTTTAC TGCAACCGCA ACTGGTGTTA AACGCTAATC TGCTTAGCCG GACACCCGCA
GCCGGTGTTG GCTACGACTG GGCGAGTTAT TACGCCTTTC GAGCCGATAA TCACAAAATC
GGGGTGGACT TGACCTTTCC TATCTTTCTG CGAAAAGAGC GGGGCAAGCT TCGGCAAATA
CAGATCAAAA ACCAGCAGGT TACACTAGAA CGCCAGCAAA CCGGCCGTGC CATCAGCAAC
GATGTTCAGG CAGCCTGGAA CGAGCTAAAG GCGCTGGAGC GCCAGATCGA TGTACAGCAG
CAGACGGTCA GGAACCAACG GATTTTACTT GGGGCAGAAC AGCAGAAATT CGACATCGGT
GAGAGTTCAC TGTTTTTAGT CAACAGCCGC GAATCGAAAC TGATCGATTT GGAAATCAAG
CTGGAAGAGC TACGTACCAA ACAGCAGAAA GCGGTAGCCG CCCTGTGGTA CGCAGCCGGA
ACAAATCCAG AAGCGCAGTA A
 
Protein sequence
MKTIIFGLSL LIGSQLISAA QAQPAGTSSD TTLFTAADFY RVVFSHHPLV RQAALLNSEA 
QQEIMQARGA FDPKLFSTYD RKEFGQDLYY NKWQSGLTIP VLPAGIDVKL TYDRNMGQYV
NPEERVPSSG LAAVGVRVPV GQGLIIDARR NALRQAKLAV TLADAERLKL INKTLFDAAK
TYWEWYMAHQ QYRLIQNGYQ VADTRFRAIQ QRSLLGDAAA IDTTEALITA QDRLVQLQQA
EVNLQNARLR LSVFLWNSTD SDGMPQPVEL LPTVAPQPVP ADRLDESTLQ ALLSKAAERH
PELLKLTTKG QQLALEERFQ RSLLQPQLVL NANLLSRTPA AGVGYDWASY YAFRADNHKI
GVDLTFPIFL RKERGKLRQI QIKNQQVTLE RQQTGRAISN DVQAAWNELK ALERQIDVQQ
QTVRNQRILL GAEQQKFDIG ESSLFLVNSR ESKLIDLEIK LEELRTKQQK AVAALWYAAG
TNPEAQ