Gene Slin_4642 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSlin_4642 
Symbol 
ID8728406 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSpirosoma linguale DSM 74 
KingdomBacteria 
Replicon accessionNC_013730 
Strand
Start bp5652264 
End bp5654669 
Gene Length2406 bp 
Protein Length801 aa 
Translation table11 
GC content55% 
IMG OID 
Productprotein of unknown function DUF214 
Protein accessionYP_003389419 
Protein GI284039489 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000162597 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones34 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTCCGCA ACTACCTCAC TATCTCGCTC AGAAATCTAT GGCGTAACCG GAAAGTTACG 
CTGATCAGCA CCGTCGGACT TTCCATCGGG CTGGCTTGCG GCCTTGTTAT CTTTTTGCTG
GTCAGCTACA TGTTCAGCTT CGACCGCTAC CATACCAAAG CTGACCGAAC CTACTGGGTT
GTTACAGACA TTCGGCAGGA GAACGTTGTG CCAACGGATG CCACGCCCCG GCCGATGGGC
GATGTGCTTC GGGAGGAGCT TCCATTTGTG GAAACGGCGG CCCGGCTCGA AAACAGCCCC
AGTCGGGTCA TGGCGGTACC CGATGGGAAA GGCGGATTTT CGAAGAAATT TGACGAATCG
CGTAGCCTTT GCTTCACCGA ACCGCAGTTC TTCAGCGTGT TCGATTCGGA CTGGTTAAGC
GGCAATCCAG AAACCGCTCT GGCAGCCCCG AACACGGTTG TTCTGACGGA ACGGTACGCT
CAAAAATACT TTGGTTCAGC CAATCCAATG GGTAAGGTGC TGCGCTTCGA TAACCAGACC
GACCTGACCG TTACAGGACT TATCAAAAAC CTGCCGTCCA ACACCAAGCT CCGGTACGAC
GCTTTCATTT CCTACGCAAC CGTGCCTACT CTTTCGGGCG GAGGAGGCCA ACAGGCCATG
CAGGACTGGA GCCGGGTATT TACCGTATGT TTCGTCACCC TCCGCCAGGG CACGCCCGTA
GAACGGCTGC TCGATGCCTT TCCGGTTATC CGGAAAAAGT ACCTGACCAC GCCCGAAGCG
AAAAAACTGG ACTTTCATGC CATTCCACTC CCGGACCTGG AGCATATGCC TCAATACGGT
GGCCGGTCGC CAGGGCTCAT TTTGTACACC CTGATTATTG TCGGGCTGTT TCTGGTGCTG
GCGGCCTGTA TCAATTTCAT CAACATTGCC ACCGCTGGTG CCCTGAAACG CGCCAAAGAA
GTGGGCGTTC GGAAAGCGGT GGGCAGTTCG CGAGGGCAAC TCATCGGGCA ATTTATGATT
GAAACAACGC TGGTTACGCT GGCGGCTGTT GCACTGGCGA TGCTGCTGGC CCACCTCTGT
TTGCCCATGC TGAACAGTGT CCTGTCTGTT ATGCACACCG ATATCTCCAT TACAAACCTG
TTCCATCCCG ACTCGCTGGT CTGGTTTGTC GCGTTGCTTG TTGGCGTTAT TCTATTGGCT
GGCTTGTATC CCTCGCTGGT GCTGGCCCGT TTTAATCCGG TAGCGGCCCT GCGCGGGCGA
CTCAGCACGC AACAGATTGG CGGGGTATTC GTTCGGCAGG GGCTGATCGT CACGCAATTT
TTTATCACCC AGCTCTTTAT CATTGGCGTG GGCGTCATGC TGGCGCAGGT ACGGCACATG
CAGCAGGCCG ATCTGGGGTT TCAGAAAGAA GCGATTTTGA CGGTGCCGGT GCCCGTTAGC
AATGCCCTTA AGCAGGATGT TGTTCGTGCC CGGATGGCAC AGATAGCGGG TGTCGAAGCC
GTGTCATTAG GTGCCGACCC GCCCGCAACC TACCGGCGAC TGCCCGTGCC GTTCACCTAC
GACACCCACA CGCAGCCGGA AAAATTCCCC ACGGTGGTTA AAGTTGGCGA CAAAAACTTT
GTGTCGCTGT ACGGAATCAG GCTGCTGGCC GGGCGCAACT TCCGAACCAA CGATACGACG
AACAACGAAG CCCTCGTGAA CGAAACAATG GTAAGAGAAT TGGGGCTGCG CTCGGCCCGC
GATGTACTCG GCAAACGCGT TAACCTGTGG GGTGGCGACA AAACGATTGT GGGTGTCGTG
CGCGATTTTC ACCTGAGCGA TTTACACCAG GGTATTCCGC CCGCCACCAT TCTGAACTAC
TACCGCGAGA ACCGAATGGC CTCACTAAAG CTCAATCCAA CCGATATACC GACAACCCTG
AAGGCAGTTG AAAGTACCTG GAATGAGTTA TTCCCGGAAC AGGTATTCAA AGCCAATTTC
GTCGACGATC TGCTGGCTAA CTTTTACATC ACCGAACACG TCCTGCTGGG GCTGGCCGAA
GTGTTTTCGC TCATTGCCGT TCTGCTCAGT TGCCTGGGTT TATATGGTCT GGTAACGTTT
ATGGCCGAAG CGAAGACGAA AGAGATTGGT GTTCGGAAAG TGCTGGGGGC CACTCCTGCT
CAACTTGTGT GGTTGTTTGG TCGCGAGTTC AGCCGACTCG TTCTGCTTGG CTTTGTACTG
GCGGCTCCGC TGGGCTGGTT TCTGATGAAC GGCTGGCTAC AGGGGTATGC CTACCGCATT
AATTTCAGCG GCTGGCTGCT GGCCGCAACC CTCGTTATAG CCAGCCTGAT TACGGCCCTG
ACAGTTGGCT ACGAATCGCT GAAAGCCGCC CGCATGAACC CGGCAAAAAG CCTTCGAAAC
GAGTGA
 
Protein sequence
MFRNYLTISL RNLWRNRKVT LISTVGLSIG LACGLVIFLL VSYMFSFDRY HTKADRTYWV 
VTDIRQENVV PTDATPRPMG DVLREELPFV ETAARLENSP SRVMAVPDGK GGFSKKFDES
RSLCFTEPQF FSVFDSDWLS GNPETALAAP NTVVLTERYA QKYFGSANPM GKVLRFDNQT
DLTVTGLIKN LPSNTKLRYD AFISYATVPT LSGGGGQQAM QDWSRVFTVC FVTLRQGTPV
ERLLDAFPVI RKKYLTTPEA KKLDFHAIPL PDLEHMPQYG GRSPGLILYT LIIVGLFLVL
AACINFINIA TAGALKRAKE VGVRKAVGSS RGQLIGQFMI ETTLVTLAAV ALAMLLAHLC
LPMLNSVLSV MHTDISITNL FHPDSLVWFV ALLVGVILLA GLYPSLVLAR FNPVAALRGR
LSTQQIGGVF VRQGLIVTQF FITQLFIIGV GVMLAQVRHM QQADLGFQKE AILTVPVPVS
NALKQDVVRA RMAQIAGVEA VSLGADPPAT YRRLPVPFTY DTHTQPEKFP TVVKVGDKNF
VSLYGIRLLA GRNFRTNDTT NNEALVNETM VRELGLRSAR DVLGKRVNLW GGDKTIVGVV
RDFHLSDLHQ GIPPATILNY YRENRMASLK LNPTDIPTTL KAVESTWNEL FPEQVFKANF
VDDLLANFYI TEHVLLGLAE VFSLIAVLLS CLGLYGLVTF MAEAKTKEIG VRKVLGATPA
QLVWLFGREF SRLVLLGFVL AAPLGWFLMN GWLQGYAYRI NFSGWLLAAT LVIASLITAL
TVGYESLKAA RMNPAKSLRN E