Gene Slin_4420 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSlin_4420 
Symbol 
ID8728180 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSpirosoma linguale DSM 74 
KingdomBacteria 
Replicon accessionNC_013730 
Strand
Start bp5356423 
End bp5358339 
Gene Length1917 bp 
Protein Length638 aa 
Translation table11 
GC content56% 
IMG OID 
ProductRNA-binding S4 domain protein 
Protein accessionYP_003389200 
Protein GI284039270 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones35 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTCAAG ATTCAGAACA GAACGACCGA AACACCGGCC CAGGCCGTGA TGGTGATTCG 
AACCGGCAGA GCGGTCGTTC CTTCGGCCGT CGGGATGACG TCCGCAACAG TGGCACAAAC
TCGTCACGCC CTAACCGGGA TTCAGATCGC CCCCGTTTTA GCCGGGATAG CGACCGGAAC
GACCGGCCAA AATTTAACCG GCCTGCCGAC GGCGACCGGC CCCGTTTCAG CCGCGATGAA
CGTAACGACG GACCGCGTAA TGACGGACCA CGTAACGGCG GACCGCGCGA CGGACAAAGA
GGAGGCAACT CCCGCGATAC GGGCCGCCCT TTCCGTAATG ACGGACCACG TACCAATGAC
CGTAACAGCG ATCAGGGCAA ACCTCGTTTT GACCGCAATT CAAACGACCG CCCTTCTTTC
AACCGGGATT CGGAGCGCCC ACGCTTCAGC CGGGATAATG ACCGTTCTTC ATCCAGTCGG
GATTCCGACC GTCCGCGCTT TAATCGGGAT AACGACGGGG GGGATCGCCC GAAATTCAAC
CGGCCAGCCG ATGGCGACCG GCCCCGTTTT AACCGCGATG ACCGGACGTA CCGCCCTGCG
GGAGCCGGTC CCCGTCCTGA TTCTGCATCA CGCCCTAATC GGGACGACCG ACCGGAGCGA
CCCCGTCGGG ATACCGATTC GCCACGATTC TCAAACGATA AAGGCCGTAG CAATGATCGT
TCGGACAACG ACCGGCCCAG ACGTGACGAC CGCCCTTCTT TCAACCGGGA TTCGGAACGC
CCGCGCTTCA GCCGGACGGA TGCACCGCGT GATACGAACC GTGAGTCGAA AAGTGACGGA
CCGGCGCGTT TCCCTCGGGA GCGTAAAAGT ACGGGTGGGT TTGACCGCCC CGAAAAGCCC
GCGTTCAAAC GGGTAGGTGG TTTTAGCCGC GAAGCTGACG AACGTAATAA TTTTGGCGGT
GAAGACCGTC GCGGTGAAGA CCGTCGTGGC AACGACCGTC GGAGCAATCG TGACGAAGAG
TCTGGTTTCA CCGGCCGTCA ACGTAAAGAG TCCGGAGATC GGCGCACAGG AAATTTTACT
AAAGCACCGG ATTATAAGCT GGAGCAGGTA CGGGCAAACC AATTTGCCAA ACGAAGCCGT
CCGGATGATC GGCGTGGTAG CCAGGATAAC GATTCGGAGA AGGGAAAACG TACGCCCGCC
AATGATGGAA CAACCCGTTT AAACCGATTC ATTGCTAACT CGGGCGTTTG TTCGCGCCGG
GAGGCCGACG AGCTTATTGC CCACGGTGAC ATTTCTGTAA ATGGCAAAAT CGTTACTGAA
ATGGGCTATA AGGTAAAGGA GGGCGATACC GTCAAATACG GTACCAAAGT CCTGAACCCC
GAACGGTTCG TCTATGTGCT GCTCAATAAG CCTAAAGATT ATATTACCAC AACTGAAGAT
CCGGAAGAGC GTAAAACGGT GATGGAACTG GTAGCCGATG CGGGTAAATT CCGGATGTAT
CCGGTAGGAC GCCTGGACCG GAACACCACC GGGTTGCTGT TGATTACCAA CGATGGCGAA
CTGGCCGATA AACTGACGCA CCCATCGAAC AACATTCGTA AAATTTACCA GGTTGAGCTG
GACAAACCCA TCACCGATGA ACATTTCGAA GCCATCAAAA AGGGCATCGA ATTGGAAGAT
GGCCCCATTA AACCCGACGC GATCAGTATT GTTACGCCGG ATGCCTATGT GGTCGGTATT
GAAATTCACT CGGGACGTAA CCGCATTGTG CGCCGTATTT TTGAAAGTTT CGGGTACGAA
GTAACAAAAC TCGACCGCAC AACCTACGCT GGTTTGACCA AGAAAGAGCT GCCCCGCGGC
AAATGGCGCT TCCTCGACCC GAAAGAAGTC GTGAAACTGA AATATCTGAA CGCATAA
 
Protein sequence
MSQDSEQNDR NTGPGRDGDS NRQSGRSFGR RDDVRNSGTN SSRPNRDSDR PRFSRDSDRN 
DRPKFNRPAD GDRPRFSRDE RNDGPRNDGP RNGGPRDGQR GGNSRDTGRP FRNDGPRTND
RNSDQGKPRF DRNSNDRPSF NRDSERPRFS RDNDRSSSSR DSDRPRFNRD NDGGDRPKFN
RPADGDRPRF NRDDRTYRPA GAGPRPDSAS RPNRDDRPER PRRDTDSPRF SNDKGRSNDR
SDNDRPRRDD RPSFNRDSER PRFSRTDAPR DTNRESKSDG PARFPRERKS TGGFDRPEKP
AFKRVGGFSR EADERNNFGG EDRRGEDRRG NDRRSNRDEE SGFTGRQRKE SGDRRTGNFT
KAPDYKLEQV RANQFAKRSR PDDRRGSQDN DSEKGKRTPA NDGTTRLNRF IANSGVCSRR
EADELIAHGD ISVNGKIVTE MGYKVKEGDT VKYGTKVLNP ERFVYVLLNK PKDYITTTED
PEERKTVMEL VADAGKFRMY PVGRLDRNTT GLLLITNDGE LADKLTHPSN NIRKIYQVEL
DKPITDEHFE AIKKGIELED GPIKPDAISI VTPDAYVVGI EIHSGRNRIV RRIFESFGYE
VTKLDRTTYA GLTKKELPRG KWRFLDPKEV VKLKYLNA