Gene Slin_4849 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSlin_4849 
Symbol 
ID8728613 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSpirosoma linguale DSM 74 
KingdomBacteria 
Replicon accessionNC_013730 
Strand
Start bp5906187 
End bp5908214 
Gene Length2028 bp 
Protein Length675 aa 
Translation table11 
GC content43% 
IMG OID 
Producthypothetical protein 
Protein accessionYP_003389626 
Protein GI284039696 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.895704 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones32 
Fosmid unclonability p-value0.860028 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAATACG CCGATAAAGT ACGGACTTTC TTTTCTGGCA AAGTAGTTCG TAAAGACCTG 
ACAGCTCAAA TAAAAGGAAG TGCCGTGGTG CCGACTTATG TGCTGGAGTA CCTATTAGGT
CAATACTGTG CCTCAGATGA TGAAGTGATC GTTGCCGATG GTATTGAGCG CATTAAAGAT
ATTATCAAGT CAAATTTTGT GCATCGTAGC GATGCTGAAT TGGTTAAGTC GCGCGTACGT
GAGCAGGGTG AGCACCGAAT TATCGATAAG ATAAATGTGG TGCTAAATGA CAAGCAGAAT
CAATATGAGG CTGATTTTGC CAATTTAATG TTGCGCGATG TGCCTATTGT CGACCGCATC
GTGACCCAGC ATAAGAAGCT ACTAAGTGGG GCGGGGGTTT GGTGTATCAT CGACCTAAGC
TATGATCATA GCCGGGATGC TAAAGTACGC TGGGGCATTG AGTCAATAAA GCCTATTCAG
GTATCTGCCA TCGACGTACA GGAATATTTG GGTGCCCGGC CTCAGTTCAC TACAGACGAA
TGGCTGGATC TGTTGCTGCA TAGCATTGGG TATGAACCCC AATATTTTAG CCGTCGTGAT
AAATTTATAC AGTTATCACG ACTCATTCCT TTTGTTGAAA GCAATTTTAA CTTTATCGAA
CTTGGTCCAA AGGGAACTGG TAAATCTCAC GTCTTCTCAG AGCTATCTCC ACACGGCGTA
TTGGTATCAG GAGGTGATGT TTCTAAAGCA ACCCTATTTG TCAATAATAA TACAAGGCAA
TTGGGTCTAG TAGGCTTTTG GGATGTGGTA GCTTTGGATG AATTCGAGCA AGAAAAGAAC
AGCAAAACCA TTGATGGAGA CTTAGTAAAA ATTCTTCAGA ATTACATGGC CAACCAATCG
TTCAACCGGG GTAATGATAC CATCATGGCA ACCGCTTCCA TGGCCTTTGT GGGTAATACA
AAGCACACGG TGCCGTATAT GCTTAAAAAC AGTCACTTGT TTGAGTCCAT ACCCCAAGGT
TACCTAAAAG GGGCTTTCCT GGACCGGATG CATCTGTATA TTCCTGGTTG GGAAGTTCGA
ATTCTGAAAG AAAGCGTATT TAGCCATGGG CATGGCTTTA TTGTTGACTA TTTAGCGGAA
ATCCTGCGTG AGATGCGTCG GCTCGATTTT TCCGATTTAC TTTCATCGGT TGCAATTCTG
GATCCATCCC TAACCCAACG GGATAAATTA GCTGTTTTCA AAACGTTTTC GGGCTTAGCA
AAGCTGTTAT ATCCGCATCG GCAACTGACC GAAGAGGAAG CAGAAGAGTT AGTCGAATTT
GCAATGGAAG GCCGTCGCCG GGTTAAAGAA CAACTGTATC TTATTGATGA GACCTTCCAA
CAACAGCCAT CCCTGTTTCG ATACGTACAT AAGGCCACTG GCCAGGAGAG GGAAGTTGCT
TTGCTGGAAA GTCTAAGCCT GGACTCAACG TTCGAAATTC CAGACGTGAT TACGGTTGAT
GTCGACTCAG GAAGTGTAAA TACGACAAGT ACAAATCCTA CGACCAACAA ACTATTTATA
GGTGATAGGG CATTCAAGGA GAACCAGACC GGTTTGTCAT TTCGCAAGTT ATTCGGTGAT
TATATTGAAG GGGCTAAACA AATTTCACTG ATTGATCCAT ACATCCGTCA GCCTCATCAA
TACCGATTGC TTATGGAATT TCTGGTGTTA ATCTCCGAAA GAAAACCACT TGACCAAGAA
GTAGACGTTG AAGTAGTTAC CTACTTTGAG TCACCGGATA AAGAGATAGA AGCGAAAGCT
AACTTTGATC AGTTAACTGA ATCTGTTGCA GACCTAGGGA TTCAGTTAAC TTACCGCTTT
GATCCCGCAA TCCATGACCG GTTCATTTAT CTGAACAACG GCTGGCGAAT AAAATTAGGA
ATGGGATTGG ATATGTTTCA AAAGCCTGAC TTAATGGATA TAGCTAGTGT CTTCCCAGAA
AAGAGAAAGT GTAAAAAACG TTTTGAAATA AGTTATCAGC GTATTTAA
 
Protein sequence
MEYADKVRTF FSGKVVRKDL TAQIKGSAVV PTYVLEYLLG QYCASDDEVI VADGIERIKD 
IIKSNFVHRS DAELVKSRVR EQGEHRIIDK INVVLNDKQN QYEADFANLM LRDVPIVDRI
VTQHKKLLSG AGVWCIIDLS YDHSRDAKVR WGIESIKPIQ VSAIDVQEYL GARPQFTTDE
WLDLLLHSIG YEPQYFSRRD KFIQLSRLIP FVESNFNFIE LGPKGTGKSH VFSELSPHGV
LVSGGDVSKA TLFVNNNTRQ LGLVGFWDVV ALDEFEQEKN SKTIDGDLVK ILQNYMANQS
FNRGNDTIMA TASMAFVGNT KHTVPYMLKN SHLFESIPQG YLKGAFLDRM HLYIPGWEVR
ILKESVFSHG HGFIVDYLAE ILREMRRLDF SDLLSSVAIL DPSLTQRDKL AVFKTFSGLA
KLLYPHRQLT EEEAEELVEF AMEGRRRVKE QLYLIDETFQ QQPSLFRYVH KATGQEREVA
LLESLSLDST FEIPDVITVD VDSGSVNTTS TNPTTNKLFI GDRAFKENQT GLSFRKLFGD
YIEGAKQISL IDPYIRQPHQ YRLLMEFLVL ISERKPLDQE VDVEVVTYFE SPDKEIEAKA
NFDQLTESVA DLGIQLTYRF DPAIHDRFIY LNNGWRIKLG MGLDMFQKPD LMDIASVFPE
KRKCKKRFEI SYQRI