Gene Slin_5993 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSlin_5993 
Symbol 
ID8729774 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSpirosoma linguale DSM 74 
KingdomBacteria 
Replicon accessionNC_013730 
Strand
Start bp7268634 
End bp7270070 
Gene Length1437 bp 
Protein Length478 aa 
Translation table11 
GC content53% 
IMG OID 
ProductProtein of unknown function DUF1800 
Protein accessionYP_003390754 
Protein GI284040824 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones38 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATAAAC TGACCAGACA GCAACAGACC CGTCATTTGT TTGCACGAGC GGCTTTTGGG 
GCAGCCCCCG CCGAATTGGA AGAAGCATCG CGTAAACCCC TTCGAAAAGT AGTGCGCCAG
TTGTTTAAAA ACAGCGAAGC CGTGACCGAT CTGAAAGCTG TTGAGCCCGA CGAAAACGAA
TCGAAGAAAC AATTGAAAGG GCTTCTTCGG CAGGGTGAAC TCGACAAAGA AATGCTTAAA
GAGCGAATCC GCGACAATGC CGAAAAAGTG CGGGATCTAA ACCTGTTGTG GATTGACCGA
ATGGCAATCG GCAACGGCGC ACTGCGTGAG AAGATGGCGC TTTTCTGGCA TGGGCACTTT
GCCTGCCGGG CGCAGGGCCG GAATCCGCTG CTCATGCAAC AGTATGCTAA TACACTTCGG
CAAAACGCTC TGGGTAAGTT TGGCGATCTG TTGATGGCGG TCTCCAAAGA GCCAGCAATG
CTACAATTTC TGAACAACCA GCAAAATCGA AAAAACGCCC CGAATGAGAA CTTCGCCCGT
GAAGTGATGG AACTCTTTAC CCTTGGACGG GGCAATTATT CGGAACACGA TATTAAGGAA
GCGGCCCGTG CCTTTACCGG CTGGCAATTT ACGCCCGAAG GCCAGTTTGT TTTTCGGGAG
CGGGTTCATG ACGAAGGCGA GAAGACCATT TTTGGTAAAA CGGGTTCCTT CAAGGGCGAA
GATGTAATTG GTATGCTCCT CGAAAACCGA CAGACCGCCC GCTTCATCAC GGCTAAGGTT
TACCGGTTCT TCGTTAACGA AACCGAAGAT AGAAAGCGGG TCGATGACTT AGCCGACCAA
TTTTACAAGA GTGGCTATGA TATTACGGAC CTGATGGAGA GCATCTTCAC CGCCGACTGG
TTCTACGATC CCAAAAACAT TGGTGCCCAT ATTAAATCGC CCGTTGAGTT ACTGGTTGGG
CTGCGTCACA CATTGGGCGT CCGGTTCGAC CAGCCCCAGC CGCAGATTTT CGTCCAGCGA
ACGCTGGGTC AACTGCTGTT TTACCCACCC AATGTTGCCG GTTGGCCGGG CGGTAAAAAC
TGGATCGACT CATCGAGTCT GCTGTTTCGG ATGCAGCTAC CGAGCTACGT TCTCAAAGCC
GCCGATGTGC TGGTGCGCCC CAAAGAAGAC GGTGATGTGA ATACCCAGCT ACTGGCCCGC
AAAGGAAACG CGAAGTTTCG CACGACCGTC GACTGGGCCG ACTTTGAGAA GGCCTTCACC
AAAACACCCG ATGCTGATCT GCCGGATGCG CTGGCAGTAA CGCTGCTGCC CTTTCCGTTG
CGTCCGGACC AGCGAACTGT ACTGGAGAGT CAGCTAAAAC CTGACCTGAC CCGTCCCGAA
CGCATTCACA CGCTAACGGC CGCCATCATG AGCCTGCCGG AGTATCAATT AACATAG
 
Protein sequence
MDKLTRQQQT RHLFARAAFG AAPAELEEAS RKPLRKVVRQ LFKNSEAVTD LKAVEPDENE 
SKKQLKGLLR QGELDKEMLK ERIRDNAEKV RDLNLLWIDR MAIGNGALRE KMALFWHGHF
ACRAQGRNPL LMQQYANTLR QNALGKFGDL LMAVSKEPAM LQFLNNQQNR KNAPNENFAR
EVMELFTLGR GNYSEHDIKE AARAFTGWQF TPEGQFVFRE RVHDEGEKTI FGKTGSFKGE
DVIGMLLENR QTARFITAKV YRFFVNETED RKRVDDLADQ FYKSGYDITD LMESIFTADW
FYDPKNIGAH IKSPVELLVG LRHTLGVRFD QPQPQIFVQR TLGQLLFYPP NVAGWPGGKN
WIDSSSLLFR MQLPSYVLKA ADVLVRPKED GDVNTQLLAR KGNAKFRTTV DWADFEKAFT
KTPDADLPDA LAVTLLPFPL RPDQRTVLES QLKPDLTRPE RIHTLTAAIM SLPEYQLT