Gene Slin_2049 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSlin_2049 
Symbol 
ID8725787 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSpirosoma linguale DSM 74 
KingdomBacteria 
Replicon accessionNC_013730 
Strand
Start bp2479518 
End bp2480885 
Gene Length1368 bp 
Protein Length455 aa 
Translation table11 
GC content52% 
IMG OID 
Producthypothetical protein 
Protein accessionYP_003386893 
Protein GI284036963 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones31 
Fosmid unclonability p-value0.717116 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAACTGT CTTATTCGTT CACGCTTACA AGCCTTGGAC TGCTCTTGTC AGGAGCGGCC 
TGGTCGCAGT CGACGGCTCC TCAGTTGTCG ACTGTCGCTA CGCCCGTAAC AACGCTGAAC
TCCATACCAC CCTCAAATGC CGGACAAGCA GTTGGTACCA ACAAACCCGC CGAAACGTCG
GCAGTCACCC TAAAATTTAC CGGTTTTGTG CGAAACGATT TTTCGTTTGA TTCCCGCCAG
ACGGTCAACC TGCGTGAAGC TTCGGTGGAC TTATACCCAC GGGATAAGCA AGTTGACGTG
AATGGAGTGG ATGTGAATGC GGTCACAAAC TTTAACATGC TGGCCATCAA TAGTCGGCTG
GGTGCGGTGT TTACCGGCCC CGATGCGTTT GGTGCTAAAA CCTCGGGATT ATTGGAAATG
GAATGGTTTG GCCCCTCAGA TGCTGCCGTG GGCGGTGTTC GGTTGCGGCA CGCCTGGGCC
AAACTAGACT GGCCGAAACG ACAGTTGGCG TTTGGCCAGT TCTGGCACCC ATTGTTTGTG
CCTGAAGTAT TTCCTGGAGT GGTCAACTTT AATACGGGCA TTCCTTTTCA GCCATTTAAC
CGAAGCCCGC AGATTCGCCT TACCGAATAT CTAAGCAAAG ATGTCAGCCT TATTCTGGCC
TTGATTGCCC AACGCGATTT TACCAGCATC GGCATAAGCG GGAGCTCGTC TGAGTATATA
CGGAATACGG CGGTGCCTAA TTTACATGCC CAGTTGCAGG TAAAGAAAGG TCGGGTGGTG
GCTGGTCTGG CATTCGATTA CAAAATGATT CGGCCACGAC TTTCGACCGG CAGTGGTACG
TCTTTACTGG TTAGCAAGGC TACAGTGGGT AGTTCGGCTG TTATGGCGTA TTTGAAAGTA
GTTGGACGAG CCACTACGCT AAAGATCGAA GCGCTCAAAG GCTCGAATAT GACGGACCAT
GTCATGCTGG GCGGCTTTCT GGCCTATGGT ACGCCTGCGG CAGGTACTAC GCCCGCCCTC
GAAACAGCTT ACAAGCCAAC GGGTATTACG TCGGTATGGG CTGAGCTGAT GGGCAATGGC
AAAACCATTA TCCCGGCCAT TTTTGTCGGA TATACCAAGA ATACCGGAAA CGATCCCAAT
GCGGTGGCTG CGTACGGACG CGGCATTGGG ATTGGCGGAC GCGGAGGCAT CGATAATCTG
TTTCGGATAG CCCCCCGACT GGAAGTTGTT TCGGGCCGGT TTCGCGTTGG AACCGAGCTG
GAGCTAACTA CGGCTGGCTA CGGTACGTCG TCAACAGATG CCAGGGTTAC TGCTGCTGAG
CAAATAACGA ATACTCGTTT GTTGCTGACA ACAGTATATT CTTTCTAA
 
Protein sequence
MKLSYSFTLT SLGLLLSGAA WSQSTAPQLS TVATPVTTLN SIPPSNAGQA VGTNKPAETS 
AVTLKFTGFV RNDFSFDSRQ TVNLREASVD LYPRDKQVDV NGVDVNAVTN FNMLAINSRL
GAVFTGPDAF GAKTSGLLEM EWFGPSDAAV GGVRLRHAWA KLDWPKRQLA FGQFWHPLFV
PEVFPGVVNF NTGIPFQPFN RSPQIRLTEY LSKDVSLILA LIAQRDFTSI GISGSSSEYI
RNTAVPNLHA QLQVKKGRVV AGLAFDYKMI RPRLSTGSGT SLLVSKATVG SSAVMAYLKV
VGRATTLKIE ALKGSNMTDH VMLGGFLAYG TPAAGTTPAL ETAYKPTGIT SVWAELMGNG
KTIIPAIFVG YTKNTGNDPN AVAAYGRGIG IGGRGGIDNL FRIAPRLEVV SGRFRVGTEL
ELTTAGYGTS STDARVTAAE QITNTRLLLT TVYSF