Gene Slin_3038 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSlin_3038 
Symbol 
ID8726790 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSpirosoma linguale DSM 74 
KingdomBacteria 
Replicon accessionNC_013730 
Strand
Start bp3687128 
End bp3688702 
Gene Length1575 bp 
Protein Length524 aa 
Translation table11 
GC content50% 
IMG OID 
Producthypothetical protein 
Protein accessionYP_003387848 
Protein GI284037918 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value0.110831 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCATCTC TCAGAACCGG TTTGCTTCTT GTGCTTTTTC TGGCCATAGC TTCGTTTTCG 
ATTGCTCCAC ACCATAAAAC AGGCATTACC TCTGCCAGCC CTGCATCACT AAGACTTGGT
CCGGCTCCCA CTGGCCAGCA ACTGGCGCGT CAATACTGTG GAAACTGCCA CCTGTTTCCG
GAACCAGAAC TGCTGGACAA AAAAACCTGG GTTACGGGGG TGTTGCCTAC TATGGCACTT
CGCCTTGGCC TTAAACTACC TGGGCAAGAC ACCCTAAGGC CTTTAGCACC CGAAGAGGAA
AAAGCTATTA GCCAACTCCA TGTCTATCCC GAAACGGCAG TCTTGTCCCT GGCCGATTGG
GAGAAAATCG TTCGTTATTA TGAACAGACA GCCCCCGAGG CACCTCTACC ACAAAAATCC
CATGCACCCG TAATCAACGA ACTGCCCCTT TTTGCGGCCA AAGAAGTGGC AATGGGCGAT
AAACCCGTGC CGCAAACAAC GCTACTGAAA TATGATAAAC AAAGCGGGCA ACTCTATGTA
GGCGATGCCC AGCACGCCTT ATACGTCCTC GACAAACAGC TTCAACTGGC CGATACGTGG
TGGATCGACA CCCCGCCAAC GGATATTGAC TTCCCAAAAA ATGCAGCCCC AAGATTACTC
ACCATTGGTA TATTCAGCCC GTCAGACCAG CAACTGGGTC GACTCATGAC GCTTGAAAGG
GCCGCTAAAC CGGGCGCTAA GCCAGTTGAT ATTCGGCAAT TACCACGCCC GGTACAGTTT
GTGACCAGCG ACCTGAACAG TGACGGTAAA GAGGATGCGG TTATCTGCGG ATTCGGCAAT
AATGCCGGAA AACTATTCTG GTATGACGGT TTCGATCCGG CTAAAGAACA TGTTTTGAAA
GCTTTCCCCG GCGCAAGAAA GGTGGAAATT AACGATTTCA ACCATGACGG CAAGCCCGAC
ATTCTGGTTC TGATGGCCCA GGCGCGGGAA GAGGTCTCCA TCTTTTATAA TCTGGGAAAC
GGGAAATTTA AAGAAAAGGT CGTACTCCAA TTCTCCCCGC TATTTGGCTC CAGTTACGTT
GAACTGGTTG ATTTCAATAA AGATGGTTTT CAGGACCTAC TATTGACCAA TGGCGATAAC
TGGGATTATT CGTCCGTCAA TAAAAACTTC CACGGCATAC GTATATATCT CAACGACCGG
AAGGATCATT TTAAAGAAGC CTGGTTTTAC CCCATCTATG GCGCGGCAAA GGCAATAGCC
CGCGACTTCG ATAAGGATGG CGACCTCGAT ATTGCCGCTG CTTCTTTCTA CACCAATCTG
GACCAGCCAG AACAGGGGTT CATCTACTTT TCTAATCAGG GCGGTCTGAT GTTTACGCCC
TTCAGTACGC CAGCGGGTGC CGACGGGAAA TGGCTAACTA TGGAAGCCGC CGATGTCGAT
CAGGATGGTG ATTTGGATAT TGTGCTGGGT TCTTACTTTC ACACGGTTGG CGAAATGACT
CAGTTGTTAT TTAAAGGTAT CACCTCTTTT CCACAACTGC TCGTGCTGGA AAACCAGGGG
AAGAAACTAC GCTGA
 
Protein sequence
MPSLRTGLLL VLFLAIASFS IAPHHKTGIT SASPASLRLG PAPTGQQLAR QYCGNCHLFP 
EPELLDKKTW VTGVLPTMAL RLGLKLPGQD TLRPLAPEEE KAISQLHVYP ETAVLSLADW
EKIVRYYEQT APEAPLPQKS HAPVINELPL FAAKEVAMGD KPVPQTTLLK YDKQSGQLYV
GDAQHALYVL DKQLQLADTW WIDTPPTDID FPKNAAPRLL TIGIFSPSDQ QLGRLMTLER
AAKPGAKPVD IRQLPRPVQF VTSDLNSDGK EDAVICGFGN NAGKLFWYDG FDPAKEHVLK
AFPGARKVEI NDFNHDGKPD ILVLMAQARE EVSIFYNLGN GKFKEKVVLQ FSPLFGSSYV
ELVDFNKDGF QDLLLTNGDN WDYSSVNKNF HGIRIYLNDR KDHFKEAWFY PIYGAAKAIA
RDFDKDGDLD IAAASFYTNL DQPEQGFIYF SNQGGLMFTP FSTPAGADGK WLTMEAADVD
QDGDLDIVLG SYFHTVGEMT QLLFKGITSF PQLLVLENQG KKLR