Gene Slin_1572 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSlin_1572 
Symbol 
ID8725306 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSpirosoma linguale DSM 74 
KingdomBacteria 
Replicon accessionNC_013730 
Strand
Start bp1897440 
End bp1900676 
Gene Length3237 bp 
Protein Length1078 aa 
Translation table11 
GC content53% 
IMG OID 
Productglycosyl hydrolase, BNR repeat-containing protein 
Protein accessionYP_003386420 
Protein GI284036490 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value0.0452612 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTAAATA GAAAATTTAC ATTTAGTATA ATCGCTTTGG CAATTGTCTG TTCATCTGTC 
GATCTAGCCT TTGCCCAGCG GAAAAGTAAA CCAGTTGCTA AATTAACATC GGCACAATCG
ACTGACAGCC TCATCAACGG CTTGCGGTGG CGTAACATCG GCCCCTTTCG CGGTGGGCGG
TCACTGGCTG TTTCCGGGCA TTCGAGCCAG CCGCTTACGT ATTATTTTGG TGCTACGGGC
GGTGGCGTCT GGAAAACGGT CGATGGCGGG GCCAACTGGT TCATGGTCTC GGATAGTACC
TTTAAATCCA GCTCAGTAGG TGCAGTTGCC GTGGCCCCTT CCGATCCGAA CACGATTTAC
GTGGGCATGG GCGAAGCCGA TATCCGCAGC AACATTGCCA ATGGCGATGG AGTCTATAAA
TCGACCGACG CGGGTAAAAC CTGGAAACAT ATTGGTCTGA AAATGGCCGA TGCGGTCGCT
AGTATTGAGG TTCATCCGAC CAATCCCGAT GTGGCGTATG TGGCCGCTTT GGGGAATCCG
TTTGCGCCTA ACAAAGAGCG GGGCCTGTTT CGTACCACCG ATGGCGGTAA AAGCTGGAAG
GCCATTCTGA CAAAAAACGA TAGTACCGGT GCTATTGTCG TGAAACTCGA TCCTAACAAC
CCGTCGATCG TTTACGCGTC GATGTGGCAG GCGTACCGCA ATAGCTATAT GATGAGCAGT
GGCGGTCCCG GTTGCGGTTT GTATAAATCG ACCGACGGGG GCGACACCTG GACTAACCTG
AGCACCAAAC CCGGTATGCC CAAAGGGCTG TTGGGCAAAA TTGGCATTAC CGTTTCGCCC
GCCAACTCGA ACCGGCTTTA TGCCATGGTC GAGAACGCTA AGGGTGGTTT ATACCGCTCC
GACGATGCCG GTGAATCTTG GCAGTTGATC AACGAAGACA AGAATCTCTG GCAGCGGCCC
TGGTATTATA TGATGCTAGC CGCCGATCCC CAAGACGAAA ACGGCTTGAT CGTACTGAAC
GTGAATGCCC TGAAATCGTA TGATGGCGGT AAGACATTTT CGACCATTGG CGTACACCAC
GGCGATACGC ATGACATTTG GTGGAACCCC AAAAATCCGC AGAACTTCAT CATTGCCGAC
GATGGTGGTG CCGAAGTGAC CTATAATGGT GGGGCAACCT TTTCCGATAT TGATATTCCA
ACCTCCCAGT TTTACCACGT AGCCGTTGAT AATGATTTTC CCTACAACCT CTACGGAGCC
CAGCAGGATA ACTCTTCTAT CCGGATTGCC AGCCGCACTA CCGAGTATTC CATTGGCAAA
TCGGCCTGGT ACCCGGTATC GGGTGGTGAG TCTGGTTATA TCACACCCGA CCCGAACAAT
CCGAACGTGA CCTACGGCGG TAGCTATGAT GGCCTGATTA CCCGCTACGA TAAAGCGACG
GATCAGAATC AGGTTATCAA TGTGTATCCA GAGTACTTCA TGGGAGCCCC TTCATCAGCG
CGGAAGTACC GTTTTCAGTG GACGTACCCC ATCGTTTTCT CGCCCCACGA CAGTAAAACG
CTGTACATCA CCTCGCAGTA CCTGCACCGG AGTACGGACA ATGGCCATTC GTGGCAGGAC
ATCAGCCCCG ATCTAACCCG TAATGACCCC AAAACGCAGG GCGATACGGG CGGCCCGATC
ACCAAAGACA ATACGGGGGC CGAAACCTTC CCCACGATCT TCACCTTTGC CGAATCGCCG
GTAGAAAAAA ACATCTTCTG GGCCGGTTCC GACGATGGGC TGATGCACAT TAGTCGGGAC
GGCGGCAAAA ACTGGCAGAA TATCACCCCC CCGGTGTCCA TGCTGCCCGA AATGGCAATC
ATGAGCATGG TTCACCCGTC TGACCATACG GCGGGTAAGG CCTATCTGGC GGCCAAGCGG
TACATGTCCG GCGACCGTAA ACCCTATATG TTCAAAACGA CCGATTATGG CAAAACCTGG
ACGAGCATCA CGGCGGGTAT TCCATCGGAC GAGCATTGCC ATGTAGTGCG CGAAGACCCG
AATAAACCTG GTCTGTTATA CGCTGGAACC GAGCGTGGGG TGTATGTATC CTTCAACGAT
GGCGGTTCGT GGGAGAAGTT AAGCATGAAC CTGCCCGTTA CGCCTGTGCG TGATTTGCAA
ATTCAGAAGC GCGAAAAAGA TCTGGTCATT GCCACTCACG GGCTGGCATT CTGGATCATG
GACGACATTA CACCGCTGCA TGAACTGATG GATGTGAAAC AGGGCGGTAA ATCGGTGGCT
CACATGCATT TGTTCAAACC TCGCCATGCC TATCGCATGG AAGGCGGTGC CGGAAACAGC
CGTCGGGGTC GTGCTGCGCC ATCTGACGAA GGCGAAAATG CTCAAAATGG CGTCATTACG
CGTTATTATC TGAAAAGCAA GCCGACCAAA GAACTACGGC TTATCTACAT GACAACCGCT
GGTGATACCA TCAGTGCCTA CTCCAGCACG AAGGATAAAA AAGGGCAACC GCTCAAAATT
GCGAAGGAGT TTTATCAGGA CGAAACCGCA ACCCGCCCCG GCTCCATTCC CGCCAGTCCG
GGAATGAACG TGTTCGTTTG GGATATGCGT TACCCCGATG CCACCGCCGT TGACGGCATC
AACGTGATGT GGACGGGTCG CGGAACGGGT GCTAAAGTCA TTCCAGTAAT GTATAAAGTA
CGGATGTTGC TGGGCGATTC GCTGATCTCG GAGCAACCCA TAGAAATCCG GAAAGACCCT
CGCTTAGCGA TAACAGCCGC CGAGTACCAG GAACAGTTTG ACCTGTTGCA GAAAATCAAC
GGAAAACTGT CCGAAACGCA CAAAGCGATC AACCAGCTTC GGCAGATTCG CACCCAGATC
AACGGGTATA CGAGCGGGGT AAAGGAACCG AAAGTAGCCG AGAAATTTAA AAACGCGGCT
AAGCCCATTC TGGACGAACT GGACAAGATC GAATCGACGC TGATGCAGCC GAAGTCGAAA
GCACCGCAGG ACGCGCTGGC GTATCCAATC CGATTGAACG ACAAAATTGC AGGTGTGGCC
TCGGTGGTTT CCTCGGCCGA TACCAAGCCA ACCAAGTCGT CTTACACCGC ATACGATGAC
CTGTCCAAAC AAATTGACAC GGCACTGACC AAGCTGAAAG AAGTCATCAA CACCCAGGTG
CCGGGCTTCA ACAAAATGGT AACCGAACAG CAGGTACCCG CTATTATTTT GAATTGA
 
Protein sequence
MVNRKFTFSI IALAIVCSSV DLAFAQRKSK PVAKLTSAQS TDSLINGLRW RNIGPFRGGR 
SLAVSGHSSQ PLTYYFGATG GGVWKTVDGG ANWFMVSDST FKSSSVGAVA VAPSDPNTIY
VGMGEADIRS NIANGDGVYK STDAGKTWKH IGLKMADAVA SIEVHPTNPD VAYVAALGNP
FAPNKERGLF RTTDGGKSWK AILTKNDSTG AIVVKLDPNN PSIVYASMWQ AYRNSYMMSS
GGPGCGLYKS TDGGDTWTNL STKPGMPKGL LGKIGITVSP ANSNRLYAMV ENAKGGLYRS
DDAGESWQLI NEDKNLWQRP WYYMMLAADP QDENGLIVLN VNALKSYDGG KTFSTIGVHH
GDTHDIWWNP KNPQNFIIAD DGGAEVTYNG GATFSDIDIP TSQFYHVAVD NDFPYNLYGA
QQDNSSIRIA SRTTEYSIGK SAWYPVSGGE SGYITPDPNN PNVTYGGSYD GLITRYDKAT
DQNQVINVYP EYFMGAPSSA RKYRFQWTYP IVFSPHDSKT LYITSQYLHR STDNGHSWQD
ISPDLTRNDP KTQGDTGGPI TKDNTGAETF PTIFTFAESP VEKNIFWAGS DDGLMHISRD
GGKNWQNITP PVSMLPEMAI MSMVHPSDHT AGKAYLAAKR YMSGDRKPYM FKTTDYGKTW
TSITAGIPSD EHCHVVREDP NKPGLLYAGT ERGVYVSFND GGSWEKLSMN LPVTPVRDLQ
IQKREKDLVI ATHGLAFWIM DDITPLHELM DVKQGGKSVA HMHLFKPRHA YRMEGGAGNS
RRGRAAPSDE GENAQNGVIT RYYLKSKPTK ELRLIYMTTA GDTISAYSST KDKKGQPLKI
AKEFYQDETA TRPGSIPASP GMNVFVWDMR YPDATAVDGI NVMWTGRGTG AKVIPVMYKV
RMLLGDSLIS EQPIEIRKDP RLAITAAEYQ EQFDLLQKIN GKLSETHKAI NQLRQIRTQI
NGYTSGVKEP KVAEKFKNAA KPILDELDKI ESTLMQPKSK APQDALAYPI RLNDKIAGVA
SVVSSADTKP TKSSYTAYDD LSKQIDTALT KLKEVINTQV PGFNKMVTEQ QVPAIILN