Gene Slin_1020 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSlin_1020 
Symbol 
ID8724750 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSpirosoma linguale DSM 74 
KingdomBacteria 
Replicon accessionNC_013730 
Strand
Start bp1234963 
End bp1238076 
Gene Length3114 bp 
Protein Length1037 aa 
Translation table11 
GC content51% 
IMG OID 
Productacriflavin resistance protein 
Protein accessionYP_003385870 
Protein GI284035940 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000109455 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clones34 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTCTCC CCGAGCTTAG TCTGAATCGA CCGGTTTTTG CCATGGTGAT GTCTATTGTC 
ATCGTCCTTT TTGGCATTAT CGGTTTCACC TTTCTGGGTG TTCGCGAATA CCCGGCTATC
GACCCGCCGG TTATTTCGGT ACGGACAAAC TATACCGGTG CCAACCCAGA TATCATTGAA
TCGCAGATTA CGGAACCCAT CGAGAAATCG TTGAACAGTA TCGAAGGGAT TCGTACAATC
TCGTCGAACA GCGCGCTCGG TGCCAGTACC ATTACAGTCG AGTTCAACCT CGATGCCGAT
TTGGAACAGG CCGCCAACGA TGTACGCGAT AAGGTAGCCC AGGCCCAACG GCAACTGCCG
CAGGATATCG ACGCCCCACC TGTGGTAACA AAAGCCGATG CCAACTCTGA TCCTATTATT
TTCATGACGG TTCAGAGTAC GACCCGCAAT CCAACCCAGT TGTCGGATTA CGCCGAAAAC
GTGCTTCAGG AACGTCTGCA AACGATTCCG GGCGTTAGTC AGGCCAACAT CTATGGCTTG
AAGCGTCAGG CTATGCGCCT TTGGATTGAC CCCATCAAAC TATCGGCCTA TCGGCTCACC
TCACAGGATA TTCAGACCGC ACTGAATGCC CAGAACGTTG AGTTACCCAG CGGTAAAGTG
TATGGTAATA CAACCGAGCT GACGGTGAAG GCGGTTGGTC GTCTGACAAC CGAAGATGAT
TTCAATAACC TCATCCTTCG CCAGACGAGC AATCAAATTG TTCGTTTCAA AGATGTCGGG
TATGCAACCA TCGGTGCGGA GAACGAAGAA ACTATCTCTA AACAGAATGG GGCAGTAGGG
GTTATTCTGG CGCTTATTCC TCAGCCAGGT GCCAACTATG TGAGCATTGC CGATGAGTTT
TATAAGCGCT TCGACCAACT CAAAAAAGAC CTGCCCGAGG ATATTATCGT AAGTATCGGC
GTTGACCGGA GTACATTTAT CCGACGCGCC ATTGAAGAAG TAGGCGAAAC ACTGCTTATT
TCGTTTGTAC TGGTCGTACT GGTTATCTAT TTCTTCTTCC GCGACTGGCT CATTGCTTTC
CGACCGCTGA TCGACATTCC GGTATCGCTT ATCGGGGCCT TCTTCATCAT GTATGTGGCC
GATTTCAGTA TTAACGTGCT GACCCTGCTC GGTATCGTTC TGGCAACCGG CCTTGTAGTA
GATGATGGTA TTGTCGTAAC GGAGAATATC TTCAAGAAGA TTGAGCAGGG CATGGACACC
AAGGAAGCTG CCCGCGAAGG TTCTAATGAG ATTTTCTTTG CCGTTATTGC AACCAGTGTT
ACACTGGCTA TCGTGTTCCT GCCCATTATA TTCCTGGAGG GTTTTGTGGG TCGTCTGTTC
CGCGAATTTG GTATCGTTGT CGCTGGTGCC GTATTAATCT CGGCCTTCGT TTCGCTAACC
CTGACCCCGG TGCTTAGTGT AAAGCTCACC AGTAAGAACC ACGGTCGGTC CTGGTTTTAC
CGAAAAACAG AGCCCTTTTT CGAATGGCTG GATAATTCTT ACCGGTCGTC ATTGAACAGC
TTCATGAAAA AACGGGGCTG GGCGTTTGTT ATGATTGGTG CCTGTCTGCT GTTTATCTTC
GGACTTGGCT CTATGCTCAA ATCGGAACTG GCCCCGCTCG AAGATCGTAG TCGGACCCGC
CTGGTGATTA CTTCGCCTGA AGGAACAAGC TATGAGGCTC AGGCATCTCT AACGGACAGG
GTCATGCAGT TTGTACTCGA CTCTATCCCC GAAACCAAAT TAGCCTTTAG CGTAGTAGCA
CCCGGTTTTT CGGGGGCAGG CGCGGTTAAC TCCTCCTTCG TGATGGAGAA CCTGGTAGAC
CCCAGCAACC GCAATCGGTC GCAGCAGGAT ATTGTCGATT ATATTAATAA AAATCTCAAG
AAGTTCAACG AAGCCCGCAT GTTCGCTACG CAGGACCAGA CCATTCAGGT TGGCCGGGGC
GGTGGATTGC CGGTGCAGTT TGTTATCCAG AACCTGAACT TCGAAAAACT CCGCGAGAAA
CTGCCAACGT TTCTGGACGA AGTAGCCAAA GACCCAACGT TCCAGAACTC CGACGTAGAC
CTGAAGTTTA ACAAACCGGA GCTGAACATT AGCATCGACC GCGAGAAAGC GACGAACCTG
GGTATTTCGG TGCAGGATGT TGCCCAAACG CTCCAGCTTG CGCTTAGTAA CCGGCGTCTG
GCTTACTTCC TGATGAACGG AAAGCAGTAT CAGGTAATTG GGCAGGTAGA CCGCGCCGAC
CGTGATGCCC CCGTCGATCT GGCCTCTTTC TATGTACGTT CCAACCAGGG GCAACTTATT
CAGTTAGACA ACCTGGTGAA ATTTCAGGAA GTGAGTAGCC CGCCCCAGGT ATACCACTAC
AACCGCTTTA AATCGGCGAC GGTATCGGCG GGTCTGGCAC CCGGCAAAAC GGTGGGCGAC
GGTGTAGAGG CCATGCGCGC TATTGCGGCT CGTACCCTCG ACGAAAGTTT CCAGACGGCC
CTTTCAGGTC CTTCCCGCGA CTATGCCGAG AGTTCGTCCA ACACCTTATT TGCCTTTGGT
CTGGCGTTAA TTCTGGTTTA TTTAGTTCTG GCGGCCCAGT TCGATTCGTT TATCGATCCG
CTCATTATCA TGATCACCGT GCCTCTGGCG CTCGCGGGTG CCGTATTCTC ACTCTGGATG
TTTAACCAAA CGCTGAATAT CTTCAGCCAG ATCGGGATTA TTATGCTGGT TGGTCTGGTT
ACGAAAAACG GAATCCTGAT TGTTGAATTC GCCAATGAAC AGCGACTGAC GGGTAAGAAC
AAGTTCGAAG CAGCAGCAGA ATCGGCTGCG TTGCGGCTTC GTCCTATTCT AATGACCACG
CTTGTAGCGG CCTTTGGTGC TTTGCCACTG GCCCTTGCCC TGGGTTCGGC TTCAAAGAGC
CGGGTACCGC TGGGTATCGT TATCGTGGGA GGACTGATGT TCTCGCTCAT TCTAACCCTG
TACGTCGTTC CGGTCATTTA CACGTACATG TCCCGACGGA AAGATGTCCA GCCTGAAGTT
GATTCAAAAT CGGAAGACAA GGAAAAACCA ACAAAGCTGG AAGTACATGC TTAA
 
Protein sequence
MSLPELSLNR PVFAMVMSIV IVLFGIIGFT FLGVREYPAI DPPVISVRTN YTGANPDIIE 
SQITEPIEKS LNSIEGIRTI SSNSALGAST ITVEFNLDAD LEQAANDVRD KVAQAQRQLP
QDIDAPPVVT KADANSDPII FMTVQSTTRN PTQLSDYAEN VLQERLQTIP GVSQANIYGL
KRQAMRLWID PIKLSAYRLT SQDIQTALNA QNVELPSGKV YGNTTELTVK AVGRLTTEDD
FNNLILRQTS NQIVRFKDVG YATIGAENEE TISKQNGAVG VILALIPQPG ANYVSIADEF
YKRFDQLKKD LPEDIIVSIG VDRSTFIRRA IEEVGETLLI SFVLVVLVIY FFFRDWLIAF
RPLIDIPVSL IGAFFIMYVA DFSINVLTLL GIVLATGLVV DDGIVVTENI FKKIEQGMDT
KEAAREGSNE IFFAVIATSV TLAIVFLPII FLEGFVGRLF REFGIVVAGA VLISAFVSLT
LTPVLSVKLT SKNHGRSWFY RKTEPFFEWL DNSYRSSLNS FMKKRGWAFV MIGACLLFIF
GLGSMLKSEL APLEDRSRTR LVITSPEGTS YEAQASLTDR VMQFVLDSIP ETKLAFSVVA
PGFSGAGAVN SSFVMENLVD PSNRNRSQQD IVDYINKNLK KFNEARMFAT QDQTIQVGRG
GGLPVQFVIQ NLNFEKLREK LPTFLDEVAK DPTFQNSDVD LKFNKPELNI SIDREKATNL
GISVQDVAQT LQLALSNRRL AYFLMNGKQY QVIGQVDRAD RDAPVDLASF YVRSNQGQLI
QLDNLVKFQE VSSPPQVYHY NRFKSATVSA GLAPGKTVGD GVEAMRAIAA RTLDESFQTA
LSGPSRDYAE SSSNTLFAFG LALILVYLVL AAQFDSFIDP LIIMITVPLA LAGAVFSLWM
FNQTLNIFSQ IGIIMLVGLV TKNGILIVEF ANEQRLTGKN KFEAAAESAA LRLRPILMTT
LVAAFGALPL ALALGSASKS RVPLGIVIVG GLMFSLILTL YVVPVIYTYM SRRKDVQPEV
DSKSEDKEKP TKLEVHA