Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Jann_2186 |
Symbol | |
ID | 3934640 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Jannaschia sp. CCS1 |
Kingdom | Bacteria |
Replicon accession | NC_007802 |
Strand | + |
Start bp | 2191793 |
End bp | 2194822 |
Gene Length | 3030 bp |
Protein Length | 1009 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 637904543 |
Product | sarcosine oxidase alpha subunit family protein |
Protein accession | YP_510128 |
Protein GI | 89054677 |
COG category | [E] Amino acid transport and metabolism [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0404] Glycine cleavage system T protein (aminomethyltransferase) [COG0492] Thioredoxin reductase |
TIGRFAM ID | [TIGR01372] sarcosine oxidase, alpha subunit family, heterotetrameric form |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 22 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCACCC GTCTGAAACC TCAAGCCGCG TCCAAATGGG GGCGTCTGAT CGACCGTGAC ACATCGGTGA AATTCACCTT CAACGGCAAG TGGATGCGCG GCCATGAGGG CGATACGTTG GCCTCGGCGC TTCTGGCCAA TGGTCAGATG CTGGTGGGCC GCTCGTTCAA GTATCACCGC CCGCGCGGGA TCATGTCCTC AGGCGCGGAG GAGCCCAACG CGCTGGTCAA TCTCGGCTCC GGCGTCACCC ATGAGCCGAA CCAGCGCGCC ACCACCACCG AGCTGTTTGA GCGGTTGGAG GCCAGGTCCC AGAACCACTG GCCGTCATTG GCCTATGACA TCGGGGCCGT GAACCAATTG GTCTCCCGGT TCCTGCCTGC GGGCTTTTAT TACAAGACAT TCCTGTTCCC GCGGGCGGCG TGGAAACATG TGTTCGAGCC GTTCATCCGC CAATCGGCGG GCCTTGGCCA AGCGCCGCAT CCCGAAACGC GCGACCCGGA CACCTACGAG CACTTTTATG CCCATGTCGA TCTGGTTGTC GCGGGCGGTG GCATTGCAGG GTTGCAGGCC GCGCTGCTCG CGGGCCGTGC CGGGGCCAGC GTTCTGTTGA TGGAGCAGAC GGCCCATTGG GGCGGGCGGG CCCCTGTGGA TGGCGTTGAA GTTGACGGTG TACCGGCTGA GGATTGGGTG AAAGACGCCG TACAAACCCT TGAGGCGATG GAGAATGTGA CGATCCGGAC CCGCGCGATG GTGGCCGGCG TCTATGACCA TGGCTATGTG CTGGGATATG AACGGCTGAC GGATCACGCA CCGGATCAGG ATGGGCCGCG CCACCGTCTC TGGCGCATTC GGGCGCGGCG GATCATTTCG GCGACCGGTG CGCTGGAGCG TCCGCTCAGC TTTTCCGGCA ACGACAAGCC GGGGGTCATG CTGGCCTCGG CCATGCGCGA CTATCTGGTG AATTACGGCG TGGCGGTCGG CGAGAACATC GTTGTTGTGA CCAACAACGA TGACGGCTAC CGCACGGCGA TTGCACAGGT GGAGGCCGGA TTGTCCGTCG CCTGCGTGGT CGATGCCCGA CCAAGTGCGA CCGGCGAATT ACCGAACAAA GCCCGTGCCT TGGGGATCAA TGTTAAAGTA AACTGCGCCG TTGCCGGTGT GAAAGGCCTG CGCAATGTTG TGTCAGTGTC GCTATGCCCC CAGGATGGAG ATGGTTTAAC CGTTGAACAA ATTCCCTGTG ATGCGGTGGC GATGGCAGGC GGCTGGTCTC CGGTCGTGCA TCTCTGGTCC CATTGCGGCG GCAAATTGAC ATGGGATGAG GGCGCGTCGT TCTTCCGCCC CGACCATGAC CGCCCACCGC TTGATCAGAA TGGGGATGGT TTCGTGACCT GCGTCGGCAG CGCAGACGGA GCGATGTTGG CGAGCGAAAC CTTAACAAAC ACTTTAATAA ACATCGACAA TGCGCTGAAA AGTATTGATT TACTCGCCGC TGACGGAGCA ACGGCGAGGA GCGACAGGGA GGAGCCGCTG GCCCCGGTCT GGGTCATGCC GAAGGCGGCG GACTACAAGA AGCGCTCGAA GATGTGGCTC GATTTCCAGA ACGACGTGAA AGTGTCCGAC GTCCAGCTCG CGGCCCGGGA GGGATATGAG AGCGTCGAGC ACACCAAGCG CTACACGACC CTCGGCATGG CGACCGATCA GGGGAAGCTG AGCAACATCA ACGGATTGGC CGTCCTTGCT GATGCACTGA ATGAAAGCAT CCCGCGCGTC GGCACGACCA CATTCCGCCC GCCTTACACA CCCATTTCCA TGGGGGCCAT TGCCGGTGAG GCGAGCCGCG AGATCTTCCA ACCCCTCCGC CGCACCCCGA TGCATGACTG GCACGAGGCC CAGGGCGCGT ATATGGAGCC CGTGGGCGGC TGGCGCAGGC CCTATTGCTT CCCGCAAGCT GGCGAGACCC ATGAGCAGGC CGTGAACCGG GAGATCGACG CCACGCGCGG CTCGCTCGGT CTGCTGGACG CCTCCACCCT CGGCAAGATC ATCGTCAAAG GCCCCGATGC GCCGAAATTC ATGGATATGC TCTACACCAA CATGATGAGC AGCCTGAAAC CGGGCAAATG TCGCTATGGC CTGATGTGCT CTGAGAATGG CTTCCTGATG GATGACGGCG TGGTCGTCCG GCTGGATGAG GACACGTTCC TGGCCCACAC GACCTCGGGC GGGGCCGATC ACGTCCATGC TCATATGGAG GATTGGCTGC AATGCGAGTG GTGGGACTGG AAGGTCCACA CCGTCAACGT GACCGAGCAA TGGGCGCAGG TCGCCGTTGT GGGCCCCAAC GCACGCAAAC TGCTGGACAA ACTGGGGGCG GATTTTAACC TGTCTGCCGA CGCACTGCCC TTCATGGGGA TGGCGGAGGG TAAGATTGGC GGCTTTGATG CGCGTGTTTT CCGCATCTCC TTCTCAGGCG AGTTGTCCTA CGAGATTGCC GTTCCCGCCT CCCAAGGCAT AGCATTCTGG GAGGCGTTGC ATGCGGCGGG CGCGGAATGG GACGCGACGC CCTACGGCAC AGAAGCGCTT CACGTGATGC GCGCTGAGAA GGGCTTTATC ATGATCGGGG ACGAGACGGA CGGCACCGTC ATCCCGCAGG ACCTTGGCCT CAACTGGGCG ATCTCCAAGA AGAAAGAGGA TTTTTTGGGC AAACGCGGGC AGGAGCGGAC GTATATGGTC GACCCCAACC GCTGGAAGCT GGTGGGGCTG GAGACAGCCG ACAAATCCGT GCTGCCCGAC GGCAGTTACG CCGTGGCCAA GGGCACAAAC GCAAACGGAC AGGCGAATGT GGAGGGGCGC GTGACGTCGA CCTATTATTC GCCGACTGTC AAACGCGGCA TTGCCATGGG GCTGGTCCTG AACGGGCCGG ACCGGATGGG AGAGACTGTT CAGTTCAACA AGGTGGATGG ATCGACTGTG GCCGCCAAAA TCGTGAACCC GGTGTTCTTT GACCCGGACG GGGAGAAGCA GAATGTCTGA
|
Protein sequence | MSTRLKPQAA SKWGRLIDRD TSVKFTFNGK WMRGHEGDTL ASALLANGQM LVGRSFKYHR PRGIMSSGAE EPNALVNLGS GVTHEPNQRA TTTELFERLE ARSQNHWPSL AYDIGAVNQL VSRFLPAGFY YKTFLFPRAA WKHVFEPFIR QSAGLGQAPH PETRDPDTYE HFYAHVDLVV AGGGIAGLQA ALLAGRAGAS VLLMEQTAHW GGRAPVDGVE VDGVPAEDWV KDAVQTLEAM ENVTIRTRAM VAGVYDHGYV LGYERLTDHA PDQDGPRHRL WRIRARRIIS ATGALERPLS FSGNDKPGVM LASAMRDYLV NYGVAVGENI VVVTNNDDGY RTAIAQVEAG LSVACVVDAR PSATGELPNK ARALGINVKV NCAVAGVKGL RNVVSVSLCP QDGDGLTVEQ IPCDAVAMAG GWSPVVHLWS HCGGKLTWDE GASFFRPDHD RPPLDQNGDG FVTCVGSADG AMLASETLTN TLINIDNALK SIDLLAADGA TARSDREEPL APVWVMPKAA DYKKRSKMWL DFQNDVKVSD VQLAAREGYE SVEHTKRYTT LGMATDQGKL SNINGLAVLA DALNESIPRV GTTTFRPPYT PISMGAIAGE ASREIFQPLR RTPMHDWHEA QGAYMEPVGG WRRPYCFPQA GETHEQAVNR EIDATRGSLG LLDASTLGKI IVKGPDAPKF MDMLYTNMMS SLKPGKCRYG LMCSENGFLM DDGVVVRLDE DTFLAHTTSG GADHVHAHME DWLQCEWWDW KVHTVNVTEQ WAQVAVVGPN ARKLLDKLGA DFNLSADALP FMGMAEGKIG GFDARVFRIS FSGELSYEIA VPASQGIAFW EALHAAGAEW DATPYGTEAL HVMRAEKGFI MIGDETDGTV IPQDLGLNWA ISKKKEDFLG KRGQERTYMV DPNRWKLVGL ETADKSVLPD GSYAVAKGTN ANGQANVEGR VTSTYYSPTV KRGIAMGLVL NGPDRMGETV QFNKVDGSTV AAKIVNPVFF DPDGEKQNV
|
| |