Gene Slin_0020 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSlin_0020 
Symbol 
ID8723748 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSpirosoma linguale DSM 74 
KingdomBacteria 
Replicon accessionNC_013730 
Strand
Start bp17739 
End bp18857 
Gene Length1119 bp 
Protein Length372 aa 
Translation table11 
GC content50% 
IMG OID 
Producthypothetical protein 
Protein accessionYP_003384893 
Protein GI284034963 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.641333 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones39 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGTCCATTC TAAAAAATAC GCTTCTCATT CTGAGCGGGC TTGTCCTTTC CCTGACGGCT 
CCGGCGCAAG TGCTGACCGA TCCGGCGGTT CAGCAAACGG TTCTGAAAAC ACTGGACAAT
ATTTATAATC TCGACTTTGC CGAATCCGAT GTACAGATTC GACAGATTCA GAGCCGGTTT
CCTCAGCACC CGATCGGGCC TATTTTGCGG GCTACCGAAC TGGAACTTCA GTACCTGCCA
GTGCATGAGA ACAAGGCGGC ATCTGTGCAG TTCATACAGG CCGTAGAGCA GGGATTGGCA
CTGGCGAAGA AAATGCTTGA TAAGGACGAG AACGACCCGG AAGGTGTGTT CTTTGCGTTG
ACGGCGCATA GCTATCTGGC TTCTTTTTAC AACAATAAAA ATGAATCGCT GAAAGCCGTT
GGCGAGTCCA AAAAGGCGTA CAACTACCTT CGGGATGGCT TTGTGCTGAT GAACAAAACG
CCGGATTTCT ATTTCACAAC CGGACTTTAT AATTACTATA TAGAACGCTA CCCAATGGAT
CATTCCATTG TCAAGCCGTT TCTGGTTTTC TTTGAGCGTG GCGACATGGC GAAAGGACTG
AAGCAGATGG ACGTAGCCGC CCAAAAAGCC ATTTTCCTGC GTCCGGTAGC CAACTATTAT
CTGGCGCATA TTTTAGTGAA GCACGAAATG AGTCCCAGCC GGGCGGTTGT TTATGCTAAA
TCGCTAGCCG ATAAGTATCC CAATAATCCA TTATTTGGCA TGCTGGGGGC CGAATCACTC
CTGCTGGCAG GCCGATACAA TGAAGCCCGC CCGTACGTAC AACGGCTAAA ACACATGTCC
AACAAACTGG TGCCTATGGC CGTGCATACG TTTAGCGGGA TGCTGGCCGA GTATGCCGAT
AAAAACGATG TGGCAGCAGC TGAATCCTAT GAAACAGCCC TCCGCCTGCC GTTCAATGAA
CCATATACCA AAGAGTACCA CGCGTTTGCC TATGCCGGTT TAGCCCGAAT TGCTGCCCGC
GCCAACGATA CAAACCGGGC TCGTATTTTC TACAAAAAAG CACTTGCCGC AGGTCAGTAC
AAAGCGCTGA TCCGCGAGGC CAAAGCGTAT AAGGGTTAG
 
Protein sequence
MSILKNTLLI LSGLVLSLTA PAQVLTDPAV QQTVLKTLDN IYNLDFAESD VQIRQIQSRF 
PQHPIGPILR ATELELQYLP VHENKAASVQ FIQAVEQGLA LAKKMLDKDE NDPEGVFFAL
TAHSYLASFY NNKNESLKAV GESKKAYNYL RDGFVLMNKT PDFYFTTGLY NYYIERYPMD
HSIVKPFLVF FERGDMAKGL KQMDVAAQKA IFLRPVANYY LAHILVKHEM SPSRAVVYAK
SLADKYPNNP LFGMLGAESL LLAGRYNEAR PYVQRLKHMS NKLVPMAVHT FSGMLAEYAD
KNDVAAAESY ETALRLPFNE PYTKEYHAFA YAGLARIAAR ANDTNRARIF YKKALAAGQY
KALIREAKAY KG