Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Slin_4522 |
Symbol | |
ID | 8728286 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Spirosoma linguale DSM 74 |
Kingdom | Bacteria |
Replicon accession | NC_013730 |
Strand | + |
Start bp | 5482181 |
End bp | 5484508 |
Gene Length | 2328 bp |
Protein Length | 775 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | |
Product | 5- methyltetrahydropteroyltriglutamate/homocysteine S-methyltransferase |
Protein accession | YP_003389301 |
Protein GI | 284039371 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 37 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATCGCAC ACAACCTCGG CTATCCACGA ATCGGTAGCC ACCGCGAACT CAAACGGGCC TGTGAACAAT TCTGGGCCAA AAAAATCACC CCTCAGCAGC TTCAGGAAGC AGGCCGTCAA CTACGGCAGG CTAACTGGCT GACCCAGCAG CAGGCTGGCA TTGCACTCGT CCCCTGCAAT GACTTTTCGT TCTACGATCA GGTGCTCGAC ATGTCGCTGA CGGTCGGAGC AATACCGAAA CGGTTCAAAC CCCTGCTCGA TGAACCAACG GCATATCCAC TGGCCCTCTA TTTCGCTATG GCCCGCGGTT ATCAGCAGGA TGGGCTGGAT CTGAAAGCGA TGGAAATGAC CAAGTGGTTC GATACCAACT ACCATTACAT CGTTCCTGAG TTTTCGGCAG ATCAGTCGTT TTCACTGCTC TCCGAAAAAA TCTTCACCGA ATTCGACGAA GCCAGAGAAA CGCTGGACAC GCTGCCTAAA CCAGTTTTGC TGGGTCCGGT ATCCTACCTT CTGCTTGGCA AGGAAAAAGA ACCGGGCTTC GACCGACTGA CGTTGCTTGA CCGGTTACTC CCGGTTTATA AGCAGATACT TGGCCGGTTA CAGCAGCAGG GCGCCGAGTG GGTTCAGTTC GATGAACCGT GCCTTTCACT GGATTTGACG CCCCATCAGC AGGAAGCCAT AGCCTCCGTT TACGGGCAGC TACGCCAGCA GTTTCCTAAT CTCAAACTAC TACTTGCGGC TTACTTCGAG TGTTACGGAG CCAACCTGAA TCTGGCCCTT CAGTTACCCG TCGATGCCCT TCACCTCGAT CTGGTCCGTT GTCCGAACCA ACTAGCCGAT ATCCTGACTA CTGATTTTGT TCAGCGCCCT ACCCAACTGT CGCTGGGTGT GGTAGACGGC CGAAACATCT GGATCAACGA CCTGGAAAAA TCGCTGGCTA CGGTTCGGCA AGCTGTAGCA GCGCTGGGTG CCGACCGAAT TCTGATTGCG CCCTCCTGCT CCCTGCTACA CGTGCCCTGC GACTTATCGA CCGAAACGGC TGAAGCCGGA TTACCGTCCG AACTCAACCA ATGGCTGGCT TTTGCCCGCC AGAAACTCGA TGAAGTGGTC ACCCTCGTCA CACTCAGCCA GGAACCCGAC AATCCGCAGG CGTCTGCTTT GCTGGTGACT AATCGGGCTG CTTTACAGGC CCGGCGGGAA TCGAATCAGA TTAATCGACC GGCGGTACAG CAGCGGCTAC GTCAGCTAAG TGAGCAGGAC GCCCGTCGAG CGAGTGACTT TTCGCAGCGT CAGGCAATAC AGCATGACCG GCTCAGATTA CCCCTTTTTC CGACGACGAC AATTGGTTCA TTCCCCCAAA CAGATGCCGT GCGGGCCAAT CGGGCAAAAT TCAAGAAAGG AGAAAAGACG CTGGCCGAGT ACGAAGCGTT CATCCGGTCG GAGACGGAGC AGGCCATCCG CTGGCAGGAA GACGTTGGCC TGGATGTACT GGTGCACGGG GAGTTCGAGC GCAACGACAT GGTAGAGTAT TTTGGAGAGT TACTCGACGG CATTGCCTTC AGCCAGAACG GCTGGGTGCA GAGTTATGGT TCGCGGTGCG TTAAACCGCC CATCATTTTT GGCGATGTTC ACCGGCCCGA ACCCATGACC GTACGTTGGG CTCAACTGGC TCAATCACTG ACCGACAAGC CCGTTAAAGG CATGCTCACC GGCCCTGTTA CCATTCTGCA ATGGTCGTTC GTCCGGGATG ACCAACCCCG GCGCGACACC TGTTTGCAAC TGGCCCTAGC TATTCATGAC GAAGTCGTGG ATCTGGAAGC GGCCGGCATC GGCATTATCC AGATTGACGA GCCCGCCCTG CGCGAAGGAT TACCACTGCG CCGTGAAAAC TGGGATACTT ACTTAAACTG GGCGGTCATG GCGTTTCGGG TGGCGGCATC GGGTGTGCAG GACCACACCC AGATCCATAC CCACATGTGT TACGCGGAGT TTAACGACAT CATTAGCGCC ATTGCCGATC TGGATGCCGA TGTCATCACC ATCGAAACTT CCCGCTCGCA GATGGAGCTT CTGGATGCAT TCGCCCGTTT TGCGTACCCG AACGAAATAG GGCCTGGGGT GTATGATATT CACAGCCCGA GTGTACCAAC CGTGGCCGAA ATGGTTGAGC TTCTTCGGCA GGCAACTCAG GTACTGCCCG CCCGGAACTT GTGGGTTAAC CCCGATTGCG GCCTGAAAAC GCGCCGGTGG CCAGAGACGA CCGAAGCCCT GAAAAACATG GTACAGGCGG CCCAAACCGC CCGTGAGACC ATACTGGAAG CGGTGTAA
|
Protein sequence | MIAHNLGYPR IGSHRELKRA CEQFWAKKIT PQQLQEAGRQ LRQANWLTQQ QAGIALVPCN DFSFYDQVLD MSLTVGAIPK RFKPLLDEPT AYPLALYFAM ARGYQQDGLD LKAMEMTKWF DTNYHYIVPE FSADQSFSLL SEKIFTEFDE ARETLDTLPK PVLLGPVSYL LLGKEKEPGF DRLTLLDRLL PVYKQILGRL QQQGAEWVQF DEPCLSLDLT PHQQEAIASV YGQLRQQFPN LKLLLAAYFE CYGANLNLAL QLPVDALHLD LVRCPNQLAD ILTTDFVQRP TQLSLGVVDG RNIWINDLEK SLATVRQAVA ALGADRILIA PSCSLLHVPC DLSTETAEAG LPSELNQWLA FARQKLDEVV TLVTLSQEPD NPQASALLVT NRAALQARRE SNQINRPAVQ QRLRQLSEQD ARRASDFSQR QAIQHDRLRL PLFPTTTIGS FPQTDAVRAN RAKFKKGEKT LAEYEAFIRS ETEQAIRWQE DVGLDVLVHG EFERNDMVEY FGELLDGIAF SQNGWVQSYG SRCVKPPIIF GDVHRPEPMT VRWAQLAQSL TDKPVKGMLT GPVTILQWSF VRDDQPRRDT CLQLALAIHD EVVDLEAAGI GIIQIDEPAL REGLPLRREN WDTYLNWAVM AFRVAASGVQ DHTQIHTHMC YAEFNDIISA IADLDADVIT IETSRSQMEL LDAFARFAYP NEIGPGVYDI HSPSVPTVAE MVELLRQATQ VLPARNLWVN PDCGLKTRRW PETTEALKNM VQAAQTARET ILEAV
|
| |