Gene Slin_5462 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSlin_5462 
Symbol 
ID8729229 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSpirosoma linguale DSM 74 
KingdomBacteria 
Replicon accessionNC_013730 
Strand
Start bp6645439 
End bp6646539 
Gene Length1101 bp 
Protein Length366 aa 
Translation table11 
GC content54% 
IMG OID 
Producthypothetical protein 
Protein accessionYP_003390227 
Protein GI284040297 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.674322 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones37 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATAGCA GCCGTACCCG TATCATCGCT GTCGTTTTTT TCCTTTTCCT TTTCCTGCCG 
ACGGTCGATC AGCTACTTGG CCTGTCGTCC CGATTCAGCA GTACCGAGAA CAGGAAACTG
AATGGGATGC CCGCGCTGAA CTTTCCGCAC CTTCGCAGCT TTGTCAAACA GTTTGACCAT
TATTACAAAG AGAATTTTGG CTGGCGGAAT GCGCTGTTTT ATGTCTACAG CCGCTGGAAG
TTTAATATTC TGGGAGAATC GCCCCTACCC GAAAAGGTAG TTGTGGGTAA GAATGGCTGG
CTGTATCTGG GCAATAGCTA CAACAAAGTC ATTGACCAGC ACCGGGGTCT GCAACCGCTT
TCGCTGGATT CGGCCCGTCG GATTGCCAGC CATCTGATGC AGCGCCAGCA GGAACTGGCC
CGTCAGGGCG TCCAACTCTA CGTTCTGGTA GCTCCCGATT CGCACACCAT TTACCCCGAG
TACCTTCCCG ACCATTTACA ACAAAGCACC GCCCCATCGC GACTGGATGT TCTCAAGCAG
GCCATTAACC AGACTAACCT TCGCTTTGTC GATATTCGGG ATACGCTTCG GGCCGCCAAA
CGAGACCATG TGGTGTATTA CCAGACCGAT ACGCACTGGA ACGAATACGG AACCCTGATC
GGCAGTGCAT TCCTACTAAA CCGGATTCGG CAGGAGCAGC CCGCTATTCC TCCCGTTCGG
CTGTCGGATT ACCACATAGA AAAGCAATTG GGCGGGGCCG GTGACCTGAC CACCATGCTG
ACGCTTCAGG ATGAGCAGCG GGATACGATC TATTATTACA TAAAACCCAT CCCCAGCCGG
GCCGCACGGC AAACGGCCCA GATTCCGAAC GAAGAGACGG GGTACCCAGC CACCCGGTTT
TCAGGACCGG GCGCGGGTCG GCTGTTAGTC ATCGGCGATT CATTCAGTCA CGGGCTTATG
AACTACCTGC CCGGCTATTT TCGTGAATCC TATTTTATCC GGGGCCGCTA CCTGGACCCT
GCGGTTATAA AAGCGGAGAA GCCCACCGTT GTCGTCATTG AAGTCGTAGA ACGCAACATT
AACCAGTTAG CCACTTTTTA G
 
Protein sequence
MNSSRTRIIA VVFFLFLFLP TVDQLLGLSS RFSSTENRKL NGMPALNFPH LRSFVKQFDH 
YYKENFGWRN ALFYVYSRWK FNILGESPLP EKVVVGKNGW LYLGNSYNKV IDQHRGLQPL
SLDSARRIAS HLMQRQQELA RQGVQLYVLV APDSHTIYPE YLPDHLQQST APSRLDVLKQ
AINQTNLRFV DIRDTLRAAK RDHVVYYQTD THWNEYGTLI GSAFLLNRIR QEQPAIPPVR
LSDYHIEKQL GGAGDLTTML TLQDEQRDTI YYYIKPIPSR AARQTAQIPN EETGYPATRF
SGPGAGRLLV IGDSFSHGLM NYLPGYFRES YFIRGRYLDP AVIKAEKPTV VVIEVVERNI
NQLATF