Gene Slin_1388 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSlin_1388 
Symbol 
ID8725122 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSpirosoma linguale DSM 74 
KingdomBacteria 
Replicon accessionNC_013730 
Strand
Start bp1687094 
End bp1689373 
Gene Length2280 bp 
Protein Length759 aa 
Translation table11 
GC content51% 
IMG OID 
Productglycosyl transferase family 51 
Protein accessionYP_003386237 
Protein GI284036307 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.319404 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value0.452017 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGTATC GTCAACGTAA AGCCCTCCGG ATTGCAGGTT GGGTATTTTT AAGTATATTT 
TTGCTGGCCC TGGTCGGCGC TGGCATTGCG TACTCCAAAC GCGAAGCCTT ATTAAAAACG
GCCCTCGAAC GAGGGATTCG AAAGGCAAAG CGTGATTATA ACCTCGACGT TAAGATTGGT
TCTGCAAAAT TCACCGGATT AAGTTCGCTG GCGTTTACTG ATATTTCTGT CGTTCCTGAA
GCGCGCGACA GTTTGGCTCG TATCCAGCGC GTCGAACTGG CTGTTCGCTT TTGGCCACTG
CTGGCTGGCA AAATCGGCTT ATCGGGCATG ACCCTCGAAA ATGGCCTCAT TCAGGTCGTA
AAGCGTGATT CGCTCACTAA TATAGATTTC CTGCTGCATA AAAAACGGGA TTCGACGGCT
ACCGACTCCG CACCCGACGA AACAACCCGA CGCACGAATC TGGCCGATGT ATCAGAGAAT
TTAATTGATA ACATACTCTC TAAGATACCC GATGACCTTC ATGTCAATAA CCTGGAATTT
CGGGGTGTCG ACGTCGGTAG CGATACACTT AGCCTGTTAA CACAAACGGC GCTCATTGAG
AACGAAGCCG TTTCGTCAAC GATCAAGTTG AATGGGAATC AGGCAACCTG GCATTTGGCG
GGTACTGCTG ACCCCGGCGA CCGAGAATAT AACCTGGCTT TATATGCCGA AGGGCAAAAT
GGTCGCTCCG TACCCATTGA GCTACCTTAC ATTCAAAAAA AGTATAACCT CAAGGTTCAG
GCTGATACGC TGCGGGCTGA ACTCCGCGAT GTAGACCGTT CGCGGGGAGA GTTTCGCCTT
GAAGGAGCCG GATCGGTTCG AAATTTGCGC ATCAATCACC CATCCATTGC CCGGACGGAT
GTACTCATCA GCCGGGCCGC TATAGATGCC AATCTGTTTG TGGGTGAGAA CTACGTGGGT
GTTGATAGCT CATCAACGCT GCATCTGGGT GAAGTCAGTG CGCATCCGTT CCTGAAATAC
ACCTTGCCCG ATGTTGCATC TGCCGAGAGT CTGGCACCGG GTAAGATTTT TGACGTACAG
CTGAACACGG GTCCGATGGA TGCCCAGGCC CTGTTCAATT CGTTTCCGCA GGGTTTGTTC
GAATCGCTGG AGGGTATGCA GGTGGACGGC AAGCTTAACT ACAACCTGGC CTTTCATTTC
GATACCGCCT TACCCGATTC GGTTAAGTTT AACTCTAGCC TGACGCAGGA CAATTTCCGA
ATCCTTAAGA TGGGGCAGAC CGATTTCAGC GCGATCAACC GCCCCTTCGT CTACACGCCC
TACGAGAAAG GCAAACCCGT TCGGGACATT ATCGTTGGCC CTGCCAATCC GGACTATACC
CCGCTGAACC AGATTTCGCC CGACCTACGG AACGCGCTGT TAACGTCGGA AGATTACAAC
TTCTTCACCC ACAACGGGTT CAACGAGAAA GCCTTCCGGG TTTCCATTGC GACTAATTTT
AAGGAAAAAT CGTTTAAGCG GGGAGCCAGC ACCATCTCCA TGCAGTTGGT GAAGAATGCT
TTCCTCAACC GCAATAAAAC CATTTCCCGT AAGATTGAAG AAATCCTGAT CGTCTGGCTC
ATCGAAAACG AACACATCGT TTCGAAAGAA CGGATGTATG AGGTCTACCT GAACATTATT
GAGTGGGGCA AAAATATCTA CGGCATCAGC GAAGCCGCCC GCTATTACTT CGCCAAAAGC
CCTTCGGAGC TTGATTTAGG CGAAAGTATC TTCCTGGCGT TTGTGGTACC TAGACCGAAA
GCAGCACTGA GCTGGTTTGT GCCCGATGGT ACGTTGCAGG TCCGTAACGT ACGGGGCTAC
TTCAGGCTCA TTGGTCGTAT TATGGCCAAA CGAGGTCTTA CTGCACCCGA CTCCGGCGCC
TATGGTTTTT ACAACGTTCG ACTCCGCGAA GGGCTGCGTC GGCAGGTGGC TCCCGTTGAC
TCCTTATTCC AGGGCGACAG CTTAATGACC GACCCGGCCG ATGTGGACAT TAATGAGGAA
GACGGCGGAG CCGGAGGCAT CGGCAATTTC TTCCGTCGAC TGTTTAAAGG AAAGAAAGGC
GACGACACAC GCAGTAACGA CGAACAGCCG ACGGTTCAAC CTGACCGGAC GGAAGTACCT
TCGACTGAAA CAGCCCCGGC AGATACGGTG AAAACCCGTA AACAATTGCG GCAGGAACGA
CGCGAACGAA AACGCAGGGA AAAGGAAGCG CAGCAGGCAC TGGAAAACAA CAATCCGTAA
 
Protein sequence
MTYRQRKALR IAGWVFLSIF LLALVGAGIA YSKREALLKT ALERGIRKAK RDYNLDVKIG 
SAKFTGLSSL AFTDISVVPE ARDSLARIQR VELAVRFWPL LAGKIGLSGM TLENGLIQVV
KRDSLTNIDF LLHKKRDSTA TDSAPDETTR RTNLADVSEN LIDNILSKIP DDLHVNNLEF
RGVDVGSDTL SLLTQTALIE NEAVSSTIKL NGNQATWHLA GTADPGDREY NLALYAEGQN
GRSVPIELPY IQKKYNLKVQ ADTLRAELRD VDRSRGEFRL EGAGSVRNLR INHPSIARTD
VLISRAAIDA NLFVGENYVG VDSSSTLHLG EVSAHPFLKY TLPDVASAES LAPGKIFDVQ
LNTGPMDAQA LFNSFPQGLF ESLEGMQVDG KLNYNLAFHF DTALPDSVKF NSSLTQDNFR
ILKMGQTDFS AINRPFVYTP YEKGKPVRDI IVGPANPDYT PLNQISPDLR NALLTSEDYN
FFTHNGFNEK AFRVSIATNF KEKSFKRGAS TISMQLVKNA FLNRNKTISR KIEEILIVWL
IENEHIVSKE RMYEVYLNII EWGKNIYGIS EAARYYFAKS PSELDLGESI FLAFVVPRPK
AALSWFVPDG TLQVRNVRGY FRLIGRIMAK RGLTAPDSGA YGFYNVRLRE GLRRQVAPVD
SLFQGDSLMT DPADVDINEE DGGAGGIGNF FRRLFKGKKG DDTRSNDEQP TVQPDRTEVP
STETAPADTV KTRKQLRQER RERKRREKEA QQALENNNP