Gene Slin_1688 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSlin_1688 
Symbol 
ID8725425 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSpirosoma linguale DSM 74 
KingdomBacteria 
Replicon accessionNC_013730 
Strand
Start bp2023028 
End bp2024458 
Gene Length1431 bp 
Protein Length476 aa 
Translation table11 
GC content53% 
IMG OID 
ProductRNA methyltransferase, TrmA family 
Protein accessionYP_003386533 
Protein GI284036603 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones31 
Fosmid unclonability p-value0.618499 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGTAGGA AGATTCATAA AACACCCGAA CGGTTGGCAC ATGTTCGTAT CGATGCCGTA 
GCCGCCGAAG GGAAGTGTAT TGTACGTACT GACGAAGGCG TAATTTTTGT TGAAAACCCC
ACCGGTGGAC CGGGTGTGGC CCCCGGCGAT GTGGTAGACC TGCGCATTAC CAATACCAAA
AAGCAGTACC GCGAAGCCGT TGCCGAACGC GTTCATGAAT GGTCTGCGGT CCGTACTGAG
CCTTTTTGTG AGCATTTTGG AACCTGTGGC GGCTGCAAGT GGCAGCATAT TCAATACAGC
GAACAACTTG GCTTTAAACA CCAGCAGGTG GTCGATCATC TGACACGTAT TGGTAAGGTG
GAACTGCCCG AGTTCCGGCC CATTATGCCT GCTCATCCAA CCCAGTACTA TCGGAACAAG
CTTGAATTTA CCTGCGCCGA AGGCCGCTGG CTGACCTCGG CCGAAGTAGG TACTAATCAA
CCGATGGACC AGCGGGCGGT AGGCTTCCAC GTGCCCGGCC GGTTCGATAA AGTGCTGCCC
ATTCGGCATT GTTACCTCCA GCCCGACCCG TCCAACGCTA TCCGCGAAGC CATTGACGCT
TATGTGCTTC AGCATGACAT GACGTTGTAT AACCTGAAGA TGCATACCGG TTTTCTGCGA
ACACTCATCA TCCGCACTGC CGATACTACG CAGCAGGTAA TGGTAACGTT GCAGGTAGCC
CAGGACAACC CCGAATTGCT CAACGGGCTG ATGACGTATT TGCAGACGCT GTTCCCACAG
ATCACCTCCC TGAACTACAT TCTGAACACC AAAAAGAACG ATAGCTATCA GGATCAGGAG
GTGGTCAACT GGGCCGGAAA ACCCTACATC GAGGAGCAGA TGGAAGCGCT CACCTTCCGG
ATAGGCCCCA AATCGTTTTA CCAGACCAAT GCCCAGCAAG CGTATAACCT TTATAAAGTG
GCCCGCGAAT TTGCCGGTCT CACCGGTCAG GAACGCGTGT ACGATTTATA TACGGGTACG
GGCACCATTG CGCTTTTTGT GGCTCGTTTG GCCAAACATG TCGTTGGCGT GGAGTATGTC
GAAGCATCGG TAGCCGATGC CCGCGTTAAT GCACAAGTGA ACGGAATTGC CAACACGACC
TTTGTTGCGG GGGATATGAA GGCGATTCTG ACCGACGAAT TTTTTGCAGA ACACGGCCGT
CCGGATGTGG TCATCACCGA TCCGCCCCGT GCTGGTATGG ATGAGGCCGT TACCAGGCAG
CTACTTAAAG CCGCTCCGGA ACGGATTGTT TACGTAAGTT GCAACACCGC TACCCAGGCC
CGCGATCTGG CGATTCTGGA CGAAGGATAT ACCGTAACGG GTGTGCAACC GGTCGATATG
TTCCCGCATA CGCACCATGT CGAAAATGTA GTTGTCCTGA CGAAGCGGTA G
 
Protein sequence
MRRKIHKTPE RLAHVRIDAV AAEGKCIVRT DEGVIFVENP TGGPGVAPGD VVDLRITNTK 
KQYREAVAER VHEWSAVRTE PFCEHFGTCG GCKWQHIQYS EQLGFKHQQV VDHLTRIGKV
ELPEFRPIMP AHPTQYYRNK LEFTCAEGRW LTSAEVGTNQ PMDQRAVGFH VPGRFDKVLP
IRHCYLQPDP SNAIREAIDA YVLQHDMTLY NLKMHTGFLR TLIIRTADTT QQVMVTLQVA
QDNPELLNGL MTYLQTLFPQ ITSLNYILNT KKNDSYQDQE VVNWAGKPYI EEQMEALTFR
IGPKSFYQTN AQQAYNLYKV AREFAGLTGQ ERVYDLYTGT GTIALFVARL AKHVVGVEYV
EASVADARVN AQVNGIANTT FVAGDMKAIL TDEFFAEHGR PDVVITDPPR AGMDEAVTRQ
LLKAAPERIV YVSCNTATQA RDLAILDEGY TVTGVQPVDM FPHTHHVENV VVLTKR