Gene Slin_3301 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSlin_3301 
Symbol 
ID8727054 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSpirosoma linguale DSM 74 
KingdomBacteria 
Replicon accessionNC_013730 
Strand
Start bp3991505 
End bp3993265 
Gene Length1761 bp 
Protein Length586 aa 
Translation table11 
GC content52% 
IMG OID 
Producthypothetical protein 
Protein accessionYP_003388111 
Protein GI284038181 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.0137891 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones34 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGTTTC GTCACCTTGT CCTGAGTTGT TTACTCGTTC TTCCAATTCT GCTTCAAAAC 
TGCGCGCAGG TTGCTCAGCC GCCGGGTGGT AAGAAAGATA CGCTGGCCCC GAAACTGGTA
AGCAGTCTAC CCACTCCCCG CCAATTGAAT TATACCGGCA AAACCGTTGA ACTGGAGTTC
GATGAATACG TCAACAGCGA GAATCTTCAG CAGAAAATAA CGATTACACC CCAAGATAGT
AATACGTTTA TCGTGAAGTC GCTGCCCTTA GGCATCCGGC TGAGCTTTAA CAAACCTTTT
CTGCCTAATA CGACCTACAC GATCGACTTT GCCGATGGCA TCAAAGATAT TACCGAACGA
AACATTGCCA AAAATACCAA AGTAGTATTT AGTACGGGGC CTGTTATCGA CTCACTTTAC
CTAACGGGAA ATGTGGTGGA CGACGAAAGT CGCGAACCCT TGCTCGGTTT TGTCGTGGGT
CTCTTTGCCT CGACGGATAC GTTGCCCATC AACCGAAAAC GCCCGCAGTA TTTTGCCCGG
ACAGACAGTA ACGGCAATTA CCGGATCGAA AACGTAAAAG CCGGACTGTA TAGAGTCTAT
GGTTTTGACG ATAAGGATTT GAATCTGGTG AACAATACGC CGGGCGAGCG GATCGCTTTT
CGGGATAGCG TCCTGAACCT GAATCGAAAT TATACGGGTA TTGATCTGGT GGCCTTTCGC
GGCTATGGTA AACCCCGAAT CAGTAGGCGG GAGCGCACCG ACGAAACACT CGGGCTGGAA
CTCAGCAGTG GTATTGCCAG CTACAAACTT CGGTACGGCA GACCCGTTTC CACGTCGGCC
GTGTCCACGT CGGCCGTGTC CACGACGGCT GTGTCCACGA CGGCTGTGTC CACGACGGCT
GTGTCCACGA CGGCTGCATC CACAACGGCA TCGACAGGAA GTTATTCGGG CGATACGCTG
ATTTCGTTTC TGGAGACTCC GAAAATGATC CGACTTTTTC GGCCTGCCAA CCGGGCGGCT
GATGACACCC TGCACCTAAC CGTGGTGGCC GAGGATTCGG TGGGCAATGT GACGGAACTG
AGGGAGCGGA TTTATTTCTC TCCGCTGAAG ACGAAAGCGA AGAACCGAAG TCCGCTAACG
GTTCAGGTAA GCCCGCCATC GAACGAACCC ATTGACAATA ACCTGGAGTT TACGCTGGTC
TTCAGCAAAC CCGTCTTTAG TTATAAAACG GAGCGGATAA TTATTGGGCC CGATAGTACG
AAACCGTTTG TTCTTACCCC TGCCAACCTG ACCTGGACGA ATAATAAGTC CCGCCTGGTT
ATCAGGCAGA AAACCAACCT GACCGATACG CTGCTGTTTC GACTGCAAAA AGGGGCCTTT
ATCAGTGTTC AGGGCGATAC GCTTGCCCGC TATTCGGCCC GCTACACCAT TGCCGAAGAA
GATAGCTACG GACTCATTGC GGGGCATGTG AATCCGGCTA CTACGGGAGC CGCCGGGAAT
AAGTTTATCG TCGAACTGCT GGATGATAAG TACACCGTAG TCCGATCTGC TTACGGAACC
CCGTCCTATA GTTTCGGTCG ATTGAAACCG GGGTTGTACC GTGTTCGACT GATTATTGAC
GCCAACGGGA ACCGAAAGCG CGACATCGGC AATGTGCTGA AGGGTCTCCA GCCCGAGCGA
ATGATCTATA ATCCCGGCAC CGAAGAAAAC GGGACAATCC GCGTCAAGCA GAATTTCGAA
CTGACGGATA TTGATTTCTG A
 
Protein sequence
MSFRHLVLSC LLVLPILLQN CAQVAQPPGG KKDTLAPKLV SSLPTPRQLN YTGKTVELEF 
DEYVNSENLQ QKITITPQDS NTFIVKSLPL GIRLSFNKPF LPNTTYTIDF ADGIKDITER
NIAKNTKVVF STGPVIDSLY LTGNVVDDES REPLLGFVVG LFASTDTLPI NRKRPQYFAR
TDSNGNYRIE NVKAGLYRVY GFDDKDLNLV NNTPGERIAF RDSVLNLNRN YTGIDLVAFR
GYGKPRISRR ERTDETLGLE LSSGIASYKL RYGRPVSTSA VSTSAVSTTA VSTTAVSTTA
VSTTAASTTA STGSYSGDTL ISFLETPKMI RLFRPANRAA DDTLHLTVVA EDSVGNVTEL
RERIYFSPLK TKAKNRSPLT VQVSPPSNEP IDNNLEFTLV FSKPVFSYKT ERIIIGPDST
KPFVLTPANL TWTNNKSRLV IRQKTNLTDT LLFRLQKGAF ISVQGDTLAR YSARYTIAEE
DSYGLIAGHV NPATTGAAGN KFIVELLDDK YTVVRSAYGT PSYSFGRLKP GLYRVRLIID
ANGNRKRDIG NVLKGLQPER MIYNPGTEEN GTIRVKQNFE LTDIDF