Gene Slin_2036 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSlin_2036 
Symbol 
ID8725774 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSpirosoma linguale DSM 74 
KingdomBacteria 
Replicon accessionNC_013730 
Strand
Start bp2460051 
End bp2461265 
Gene Length1215 bp 
Protein Length404 aa 
Translation table11 
GC content52% 
IMG OID 
Producthypothetical protein 
Protein accessionYP_003386880 
Protein GI284036950 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.997579 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value0.0998645 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACACAGC TGGAAACCGC CGACCGCCAG GCATACCGAT CATCTTCCGT ATCCATGCGG 
GTCGATGCCA TCGATATTCT CCGGGCCATG ACCATGATTC TGATGATTTT TGTCAATGAT
CTCTGGTCGC TGACCGCCAT TCCCGACTGG CTGGAACACG TGCCCCATGG CGTTGACGGG
ATTGGATTGG CAGACGTTGT CTTTCCGGGG TTTCTGTTCA TCGTGGGTAT GTCGCTGCCG
TTTGCCATGA ATGCCCGACG GCAGAAAGGC GATACGAATA GCGCCCTCGT TAGCCACATT
ATCATGCGTT CGATTGCCTT ACTGGTGATG GGCGTCTTTT TGGTCAATGG CGAGTCTATT
GACCAAAAGG CAACGGGAAT CAACCGGCTC GTATGGAATG TGCTGTCGTG CGGTTCGTTT
ATTCTGCTCT GGAATGCTTA TCCCAGAACA GCGGCAAAAT GGGTAGTTGC CGTCGCCAAA
GGCGTAGCTA TTGCCACGCT CATCCTTCTG GCATTTGTAT ATCGGGGAGA AGAGGAAAAG
CGGTTCGCGA CGCACTGGTG GGGTATTTTG GGTTTGATCG GCTGGTCGTA TCTGGCCGCA
GCTCTGGTAA CGGTATTTGC CCGTAATCGA ATTAGTATAT TACTGGCGGC CTGGGTTGGC
TTCAGTATGT TGAGTATGGT ATCGGCGGCC AACCTGCTGC CCGAATCCCT TTCGTTCATT
CCCAGTGCTA TTCGGGGAGG CACGTTGGTG GGGCTCACGA TGGGGGGCGT TGTGGTGTCT
ACACTCTTTG ATTACTTCCG GCGGCAAGCT GATCATAAGC GGATGACACT CATTTTTCTG
TTGGCGGCTG TCGCATTAAT TGGGGTATCC ATATACACCC GAACGTTTTG GGGCCTGAGT
AAGCTGGACG CTACGCCAGC CTGGCTGTTT TTGTGCAGTG CCTTCACCAT GCTTGCGTTT
CTGATAATCT ATTGGCTGGT AGACAGGGGA GGGAATGCCC GCAAACTCAA CTTCATAAAA
CCCGCCGGCA CGGATACCCT GCTCTGCTAC CTGATTCCCT ACTTTGCGTA CGCATTTACC
ACGTTGTTAA ATCTTCATCT ACCCAATGCG TTACTGACCG GCGGAATTGG CTTGTTAAAA
TCACTTCTGT TTGCACTGCT CTGTGTCTGG ATAACGGGCC TGTTAAACAA GGCAGGCGTA
CGGTTAAAAC TTTAA
 
Protein sequence
MTQLETADRQ AYRSSSVSMR VDAIDILRAM TMILMIFVND LWSLTAIPDW LEHVPHGVDG 
IGLADVVFPG FLFIVGMSLP FAMNARRQKG DTNSALVSHI IMRSIALLVM GVFLVNGESI
DQKATGINRL VWNVLSCGSF ILLWNAYPRT AAKWVVAVAK GVAIATLILL AFVYRGEEEK
RFATHWWGIL GLIGWSYLAA ALVTVFARNR ISILLAAWVG FSMLSMVSAA NLLPESLSFI
PSAIRGGTLV GLTMGGVVVS TLFDYFRRQA DHKRMTLIFL LAAVALIGVS IYTRTFWGLS
KLDATPAWLF LCSAFTMLAF LIIYWLVDRG GNARKLNFIK PAGTDTLLCY LIPYFAYAFT
TLLNLHLPNA LLTGGIGLLK SLLFALLCVW ITGLLNKAGV RLKL