Gene Slin_3074 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSlin_3074 
Symbol 
ID8726826 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSpirosoma linguale DSM 74 
KingdomBacteria 
Replicon accessionNC_013730 
Strand
Start bp3723527 
End bp3726091 
Gene Length2565 bp 
Protein Length854 aa 
Translation table11 
GC content52% 
IMG OID 
Productglycosyl transferase family 51 
Protein accessionYP_003387884 
Protein GI284037954 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value0.846458 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTCAAT ATAGGGAGCG ATTCCGCGCT GTCAGGCACG CTTTTGGCGC ATTTCGGCAA 
CGTCAAAAAT TAACTGGCAA TGTCAGACGC AGAGTCGGCC GCCATATCGC ACGAGTTGCG
GGCGAAGAGC GCGTCAATGC ATGGGTCAAT ACGTATGATA CGTACCGCAA CCAGTTTCGT
TCATTTGTTC ATCGGTATAT CGACCCCGAT TCCTGGTATT ACCCCTACGT CAAAAATACC
GTTAAAGGGC TACTCGTGGC ATTGCTGGCC CTGGGTATGT ACGTATTCGT ACTAAACTAC
AACTTTCTCT ATCTGACGGG TGCCATGCCC AGCGTTGAGG AGTTGAAGAA CCCTAAACTG
AATCAATCGT CAGAAATATA CTCTCAGGAT GGCGTAATGA TCGGCAAGTT CTACGCCGAG
AACCGAACGC CTATCAAATA CGAGAACATA CCCAAACAGT TGATCAATGC GCTGGTAGCT
ACGGAAGATG CCCGTTTCTA CGACCACGGT GGCGTTGACC CCCGCGCCAT TGGCCGGGCT
GTGATTAGTT TCGGCCGCGA AGGGGGTGGC TCAACCATTA CCCAGCAATT AGCGAAAAAC
CTCTTTAAAA CCCGCCGAAA AACCAATACG GGCCTGCTGA CCCGCATTCC GTTCATTCGG
AAGGTCATCT ATAAATCGAA AGAGTGGCTG ATGGCCCTGA AGCTGGAGCG TAATTTTTCG
AAAGAAGAAA TTATCACTTA CTATTTCAAT ACCGTCGACT TTGGCAGCAA TGCCTTTGGG
TTAAAAACAG CGGCCCGTAC CTTTTTCAAC AAAGCCCCGG ATAGCCTGAA CGTACAGGAA
GGAGCCGTAC TGGTTGGTCT GCAAAAAGCT ACGACCAATT ACAACCCGCT CAAAAATCCC
AAGCGCTCCC GCGAGCGCCG GAATGTGGTC CTGGCGCAGA TGGCCAAATA TAATTTCCTG
ACCAAGTCTC AGGCCGACTC CATCAGTGCC CTACCGCTCG AAACGGAATT TACGCCGGAA
AACCCCTACT CGGGACCAGC CAGCTACCTC AAAAATGCCG TTCAGGATTA TGTAAAAAAA
TGGGGTGAAG AAAATGGCTA CGACCTGTAT ACGGATGGGC TTCGCATTAT TACGACCATT
GACTCCCGGA TGCAGACTTA TGCCGAAACG GCCACCAGTG AGAAAATGAA GCAGCTTCAG
CGCACCTTCG ACAACCACTG GCAGGGCCGG AATCCGTGGA CCGACGAGAA AGGAGCTGAG
CTGCCCGGCT TCATTGACTC CGTGGCCCGG CGTACGGAAC GATACAAATC CCTGAGCCGA
CGGTTCATGC CATTATACCC TGACTCGATC ATGTACTATA TGAAAAATGT GAAGTACAAG
ATGAGGGTGT TCAGTTGGAC AAGCAAGCGC GGGTATGATT CCGTCGAAAT GACGCCTTAC
GACTCCATTG CCTATTACAA GCACTTTCTG CAGGCGGGTA TGGTAGCGAT AGACCCTCGT
ACGGGCTACA TCCGGGCCTG GGTGGGTGGC CTGGATTACG ACTACTTCAA ATATGACCAC
GTAAAACAAG GCAAACGGCA GCCGGGGTCC ACCTTCAAAC CCTTTGTGTA TACGACTGCC
ATTGATGACA GCCTTATCAA CCTTAGCCCC TGCGACCGTA TTCAGGACAA ACCTTTCCGG
AAGGAGTATC GGGAGAATGG CGAGGACAAA ATCTGGGAAC CCCGCAATTC GACCGGTTAT
TACTCGTATT CGAATATGAC CCTCCGCCGG GCCATGGCCC GTTCGGTCAA CTCCATCACG
GCGCAGCTAA CCGACCGCGT TACTCCCGAA CGGGTAGCCC AATATGCACA CCGGATGGGA
ATAAAGAGTA GACTCGAAGC CGTTCCCTCC ATTGGTCTGG GCTCGTCGGA TGTGTCGCTC
TACGAACTGG TGGGTGCTTA TTGTACGTTT GTGAACGATG GCGAATCTAC CGAGCCAATC
ATTGTGCAGC GCATCGAAGA CCGGGATGGT AACGTGATCG AAACATTCAC AAGTCAGCAT
AAACGAGCCA TAAGTCCGGA AACGGCCTTT TTGATGCGCT ATATGCTCCA GGGCGGTTTG
CAGGAACCGG GCGGCACATC GCAAAACCTA TGGTCGTTCG ACCTCTTTAA GAATCATAAT
GAAATGGGTG GTAAAACCGG CACGACCTCC AACAACTCCG ACGGCTGGTT TGTGGGCGTA
TCCAACAATC TGGTTGTAGG GGCCTGGGTA GGTGGTGACG ACCGCAGTAT CCACTTCCGG
TCTACCGACC TCGGCGAAGG AGCAAAAACC GCCTTGCCGC TGGTGGGTAG TTTTCTGGAG
AAAGTATACC GGGACCCAAA ATTCAAAAAC CTCCAAGGCC CTTTCCCGAA GGCCGTCGGC
ATCACAAAAG AGTACTTAAA CTGCGGCTAC TCCGGCGACG AAGAAACGTC CGATGAATCA
GATTCGACCG ATGTATCGGA TATCGCCACC GACTCCACCT TATCGCCAAC AACCCCTGCT
CCGGACCCTG TAACGCCCCC AGATACGACA AAAAGCGGGC AATAA
 
Protein sequence
MSQYRERFRA VRHAFGAFRQ RQKLTGNVRR RVGRHIARVA GEERVNAWVN TYDTYRNQFR 
SFVHRYIDPD SWYYPYVKNT VKGLLVALLA LGMYVFVLNY NFLYLTGAMP SVEELKNPKL
NQSSEIYSQD GVMIGKFYAE NRTPIKYENI PKQLINALVA TEDARFYDHG GVDPRAIGRA
VISFGREGGG STITQQLAKN LFKTRRKTNT GLLTRIPFIR KVIYKSKEWL MALKLERNFS
KEEIITYYFN TVDFGSNAFG LKTAARTFFN KAPDSLNVQE GAVLVGLQKA TTNYNPLKNP
KRSRERRNVV LAQMAKYNFL TKSQADSISA LPLETEFTPE NPYSGPASYL KNAVQDYVKK
WGEENGYDLY TDGLRIITTI DSRMQTYAET ATSEKMKQLQ RTFDNHWQGR NPWTDEKGAE
LPGFIDSVAR RTERYKSLSR RFMPLYPDSI MYYMKNVKYK MRVFSWTSKR GYDSVEMTPY
DSIAYYKHFL QAGMVAIDPR TGYIRAWVGG LDYDYFKYDH VKQGKRQPGS TFKPFVYTTA
IDDSLINLSP CDRIQDKPFR KEYRENGEDK IWEPRNSTGY YSYSNMTLRR AMARSVNSIT
AQLTDRVTPE RVAQYAHRMG IKSRLEAVPS IGLGSSDVSL YELVGAYCTF VNDGESTEPI
IVQRIEDRDG NVIETFTSQH KRAISPETAF LMRYMLQGGL QEPGGTSQNL WSFDLFKNHN
EMGGKTGTTS NNSDGWFVGV SNNLVVGAWV GGDDRSIHFR STDLGEGAKT ALPLVGSFLE
KVYRDPKFKN LQGPFPKAVG ITKEYLNCGY SGDEETSDES DSTDVSDIAT DSTLSPTTPA
PDPVTPPDTT KSGQ