Gene Slin_5521 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSlin_5521 
Symbol 
ID8729289 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSpirosoma linguale DSM 74 
KingdomBacteria 
Replicon accessionNC_013730 
Strand
Start bp6721496 
End bp6723319 
Gene Length1824 bp 
Protein Length607 aa 
Translation table11 
GC content45% 
IMG OID 
ProductTetratricopeptide repeat protein 
Protein accessionYP_003390286 
Protein GI284040356 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.0375374 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value0.605341 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTAATAA GGTCAAAAGT CATTATTGGG CTGTTATGGG TTTTGTCTGC CGGACTTACC 
TGGGCGCAAG TTGGCCCCAC ATCGGGCGGA GCGACGCTGG CCGAAGAGTA TTATAAAGCT
GGGGACTTCG CAAAGGCTGC GAATGAATAT GCGAAGTTGT TGAAGACTGA TGTGAACTGG
ACGAAACTTT CCAGGTATGT GTATAGCCTC CAAAAAAGTA ATAAGACCGA AGACGCAATA
AAATTTTTAC GCAAGCAGCA GCGTAGCGAC GAAGCCAACC GATCCTACTA CGAGTTGTTG
AGTGGTCAGT TGGCCACGCA ACAGGGCGAT ACCACACAGG CAAAAGCTCA ATTTACGGCC
GCCGTACAAT CGAGCCGATC GTCAATCGCC AAGCTCGAAA AACTTGCAGC TGCCTTCAGC
GAGGTAGGAG AGAGCCGCTG GGCTATCCGA ACACTCGAAA CTGCCCGGGA AGTCAGTAAA
GAGCAAACGG CCTACAGTGA GGAGTTAATG ACGCTGTATC GGTCGACAGG GCAAACCGAA
AAAGCCATCG ACGAAATTAT TCTGACTGGA AAGCAACCTG AAAAGAAAGA GACCGTTTTA
GCCGCTTTGC AAGGGTTTAT CAACACCAAA GATGAACCCC TTGTGGAAAA AGCACTTTAT
TCCAAAATTC AGCAGGAACC GAACGAGTTG TCGTACAATG AACTACTGAT TTGGTATTTT
GTCCAGAAGC AAAAATTTAG CCGTGCCCTG CTACAGGAAA AAGCAACCGA CAAACGGCTG
AAATTAAACG GGAGCCGGGT GTATGATTTA GGGATGCTGG CCATGAACAA TAAAGAATAC
AAAACAGCAG CTGAGTCATT CGAGTATGTA TCCACAACAT ATCCACAAGG ACAACTCTAC
CCGTTTGCGC GCCGATTGGT CATTAGTGCG CGGGAAGAAC AGGTAAAGAA TACTTACCCG
GTTGATAAGT TAGAGATTAG AAAGCTTATT AGCGATTATC AAAAAATGCT TCAGGAGATC
GGTACGAATA GCAAAACACT CGAAGCATTG CGTAGTACTG CCAATCTATA TGGCAATTAT
CTGGATAGTA AGGATACCGC ATTGACTGTA CTTGATCTGG CTATCGATTT AGGTAAAACC
GATAAAAATT TTGTTGATCG CTGCAAGCTG GATAAGGGCG ACATCTACCT CCTGAAGGGA
GAGCCCTGGG AATCAACATT GCTGTATTCA CAGGTTGAGA AGTCACAAAA GGAGGAATTA
CTAGGCTACG AAGCAAAGCT CAAAAATGCA AAGCTCCAAT ATTACCGGGG TAATATGGCT
GTCGCGAAGG ATCTACTGGA TGTGCTTAAA CTGGCCACTT CCCGCGAAAT TGCTAATGAT
GCCGAGCAGT TAAGTTTATT GATTGTGGAT AACACGGGCT TAGACAGTAC AGAAGCTGCC
ATGCGACACT ATGCAGATAT CGATTTGATG CTATTCCAGA ACAAAACAGA AGAAGCCGTT
CTAGAACTAA ATAAGATGTT GAAAACCTAC CCAGAACATA GCCTCGTGGA TGAGATTCTA
TGGCTGCGAG CAAATACCTT TTTGAAACAG GGAAAGAATG CGGAAGCGTT AGAAGACCTC
AAAAAAATTG TTGCTTCTTA TCCGAACGAC ATACTTGGCG ATGATGCCCA ATTTATGCAA
GGGAAAATCT ATGAAGACCG GTTGAAAGAT AAACAAGCAG CAATGGAAGC TTACCAAAAA
GTCTTAACAC AATATCCGGG TAGTATCTAT GGAGCCGAAG CACGTAAACG CTTCCGTGCG
TTGAGAGGAG ATACCTTGAA TTAG
 
Protein sequence
MLIRSKVIIG LLWVLSAGLT WAQVGPTSGG ATLAEEYYKA GDFAKAANEY AKLLKTDVNW 
TKLSRYVYSL QKSNKTEDAI KFLRKQQRSD EANRSYYELL SGQLATQQGD TTQAKAQFTA
AVQSSRSSIA KLEKLAAAFS EVGESRWAIR TLETAREVSK EQTAYSEELM TLYRSTGQTE
KAIDEIILTG KQPEKKETVL AALQGFINTK DEPLVEKALY SKIQQEPNEL SYNELLIWYF
VQKQKFSRAL LQEKATDKRL KLNGSRVYDL GMLAMNNKEY KTAAESFEYV STTYPQGQLY
PFARRLVISA REEQVKNTYP VDKLEIRKLI SDYQKMLQEI GTNSKTLEAL RSTANLYGNY
LDSKDTALTV LDLAIDLGKT DKNFVDRCKL DKGDIYLLKG EPWESTLLYS QVEKSQKEEL
LGYEAKLKNA KLQYYRGNMA VAKDLLDVLK LATSREIAND AEQLSLLIVD NTGLDSTEAA
MRHYADIDLM LFQNKTEEAV LELNKMLKTY PEHSLVDEIL WLRANTFLKQ GKNAEALEDL
KKIVASYPND ILGDDAQFMQ GKIYEDRLKD KQAAMEAYQK VLTQYPGSIY GAEARKRFRA
LRGDTLN