Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Veis_4048 |
Symbol | |
ID | 4695323 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Verminephrobacter eiseniae EF01-2 |
Kingdom | Bacteria |
Replicon accession | NC_008786 |
Strand | + |
Start bp | 4437361 |
End bp | 4440378 |
Gene Length | 3018 bp |
Protein Length | 1005 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 639851795 |
Product | sarcosine oxidase alpha subunit family protein |
Protein accession | YP_998771 |
Protein GI | 121610964 |
COG category | [E] Amino acid transport and metabolism [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0404] Glycine cleavage system T protein (aminomethyltransferase) [COG0492] Thioredoxin reductase |
TIGRFAM ID | [TIGR01372] sarcosine oxidase, alpha subunit family, heterotetrameric form |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.802614 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCGGCA CCAGGGTCAG GCAGCCGGCC GCCCGCATCG ACGGCTCGCG CCAGTTGCGC TTTAGCTTCA ACGGCCGCGA CTACACCGGC CACCCCGGCG ACACGCTGGC CTCGGCGCTG CTCGCGCAAG GGGTGCGGTG CGTGGCGCGC AGCTTCAAGT ACGGGCGGCC GCGCGGCATC ATCGGCGCCG GCGCAGAAGA GCCGAATGCG CTGGTGCAGC TCGGCGTGGG CGCGCTCACG ACGCCGAACG TCAAAGCCAC GCAGGCCGAG CTGTATGAGG GCTTGGTCGC CCATTCCACG TCGGGCTGGC CGGCGCTGGC TTTCGACCTG AAATCGCTGC TCGGCCGGGG CGCGCGCTCC ATGATGCCGG CCGGGTTCTA CGGCAAGACC TTCAAATGGC CGCGCCGGCT GTGGCCGCTG TACGAGGCGG TGCTGCGCCG GTGCGCCGGC TGGGGCGCGG CACCCGGGTT GCCCGACCCC GAGCGCTACG ACCACTTGCA CCACCATGTG GATGTGCTGG TGGTCGGCGC CGGCGCCTGC GGCCTGCTGG CCGCGTTGCA GGCCGGGCAG GCGGGCCTGA AGACCTTGCT GCTCGACGAG CAAAACGAGT TGGGCGGCTG GCTGTTGTCC GACCCGCGGG CGCGCATCGA CGGCCGCGAC GGCCCGGCCT ACATCCGCTC GGTGCAGTCC GCCTTGGCGG GCTTGCCGCA GGTGCGCGTG CTGACCCGCA CCACCGCCTT CGGCATGTAC GAGCACAACC TGGTGCAAGC GGTCGAACTG GTGCAAGACC ATATCGCCCC GGCCGAGCGC CAGGCCCATC TGCCGCGCCA GCGCCTGCAC AAGATCCGCG CGCGCCAGGT GGTTCTGGCC ACCGGCGCCA TCGAGCGCCC GCTGGTCTTT GGCAACAACG ACCTGCCGGG CGTGATGACA GTCTCTGCCG GGCAGACTTT TTTGCAGCGC TACGGCGTGC GGGTCGGGCA GCGCGTGGTG ATTTGCGGCA CCAGCGATCT GATCCACGAT TGCGCCGAAG ACCTGGCCCA GGCCGGCGCC CGCGTCGTCG TGGCCGATGT GCGCCATGGC GTGACCGCCC GCAGCAGCGC CTACCAGGTG TTGGGCGGCC ACGGCATTGC CCGGGCCATG GGCCGCGGCC ACGTCAAAAG CGCGCACCTG GTGCCGCTGC ACGCCACGCG CGAGGAGGCC ACAAGCGCGG GCCGGCATGT GGCCTGCGAC GTGGTGCTCA GTTCTGGCGG GCTGTCGCCA ACGGTGCATC TGTTTTGCCA TGACGGCAGC CGCCCGCTCT GGGACGACGC AGCGGCGGCC TTCGTGGCCC CCGGCACCGG GCGCCCGGGC GTGGCCTGCG TCGGGGCGGT CACCGGCGCG TTCGAATTGC CGGCGGCGCT GGCGCAGACC ACACAGGCCA TGCACCGGGT GCTGGCCGCC TGCGGCCGGC AGCGGTCATT GCAGACACCG GTTTGCCCGC CCCCGCCCCA ACGACGGGCG GCGCGGCCGA TGTTCCTGAT GCCCTCCTGC TGCGCGTCCG ATGGCAAGCG GGCCAAGCTC CATGCCAAGG CTTTCGTCGA CTACCAAAAC GACGTGACCG CCGCCGACAT CGGGTTGGCC GTGCGCGAGA ACTACCACAG CATCGAGCAT GTCAAGCGCT ACACCGCGCT GGGTTTCGGC ACCGACCAGG GCAAGCTGTC GAACGTCAAC GGCGTGGTGC TGACCGCGCG CGCGCTGCAG CGCCCGGTCG GCGAGGTCGG CACCACCACC TACCGCCCGG CCTACACCCC GGTGAGCCTG GGCGCGCTGG CCGGGACGAT GGTGCAAGAC TGCTTCGACC CGAGCCGCTA CACCGCGCTG CATGAGGCCC ATGTGGCGCG CGGCGCGGCG CTGGAGCCGG TGGGCCAGTG GCTGCGGCCC TGGTATTTCG CCCGCGCCGG CGAGGACCTG CGCGCTGCCG TGAACCGCGA ATGCCTGGCG GCGCGCCATG GGGTGGCGCT GATGGACGCC TCGACGCTGG GCAAGATACA GATCGACGGC CCGGACGCGC GCGAATTCCT CAACCGCATC TACGCCAACG CCTGGAGCCA GTTGGCGGTG GGCAAATGCC GCTACGGCCT GATGCTCGAT GAAAACGGCA TGGTGATGGA CGACGGCGTG ACCGCCTGCA TCACGCCCCG GCAGTTCTAC ATGACCACCA CCACCGGCGG CGCCGCCCGG GTGCTGAACT GGCTCGAACG CTGGCACCAG ACCGAGTGGC CCGAGCTGAA GGTCTGGATG ACCTCGGTGA CCGACCACTG GACGACCATC GCCCTGGTCG GCCCCAAGGC CCGGACGGTG CTGGCCAGGC TGTGCCCGGA CATCGACCTG CGCGCCGACA GCTTCCAATT CATGGACTGG CGCGCCGGCA CGGTGCATGG CCTGCCGGCC CGGGTGTTTC GCATCAGCTT TTCCGGCGAA CTGGCGTATG AGCTGAATGT CGAATCCGGC TACGGCCACG CGCTGTGGGA AGCCGTGATG GCCGCCGGCG CCGAGTTCGA CATCACGCCC TACGGCACCG AGACCATGCA TGTGCTGCGC GCGGAAAAGG GTTTCATCAT CGTCGGCCAG GACACCGACG GCTCGATCAG CCCGCTGGAC CTGGGGATGG GCTGGGCCGT GGGCATGAAA AAAACGTACA GCTTCCTGGG CAAACGCTCG CTGGCGCGCA GCGACACCGC GCGCGACGAC CGCAAGCAGT GGGTCGGCCT GCTGACGCAA GACCCGTCGG TGGTGCTGCC CGAAGGCGCG CAAATCATGG ACAGCGCCCG CACCGGCGCG CACAACCGGA TGCTCGGCCA TGTGACTTCC AGCTACCACA GCGCCTTCCT GGGCCGCTCC ATCGCGCTGG CCGTGGTCGC TGCGGGCCGG CAGCGGATCG GCCAGACCCT GTATGCGCAC GCCCATGGAC GCGCCACCGC CGCGCAGGTG GTCGGCAGCG TGTTCGTCGA CCCGAAGGGG GAGCGACAAA ATGTCTGA
|
Protein sequence | MSGTRVRQPA ARIDGSRQLR FSFNGRDYTG HPGDTLASAL LAQGVRCVAR SFKYGRPRGI IGAGAEEPNA LVQLGVGALT TPNVKATQAE LYEGLVAHST SGWPALAFDL KSLLGRGARS MMPAGFYGKT FKWPRRLWPL YEAVLRRCAG WGAAPGLPDP ERYDHLHHHV DVLVVGAGAC GLLAALQAGQ AGLKTLLLDE QNELGGWLLS DPRARIDGRD GPAYIRSVQS ALAGLPQVRV LTRTTAFGMY EHNLVQAVEL VQDHIAPAER QAHLPRQRLH KIRARQVVLA TGAIERPLVF GNNDLPGVMT VSAGQTFLQR YGVRVGQRVV ICGTSDLIHD CAEDLAQAGA RVVVADVRHG VTARSSAYQV LGGHGIARAM GRGHVKSAHL VPLHATREEA TSAGRHVACD VVLSSGGLSP TVHLFCHDGS RPLWDDAAAA FVAPGTGRPG VACVGAVTGA FELPAALAQT TQAMHRVLAA CGRQRSLQTP VCPPPPQRRA ARPMFLMPSC CASDGKRAKL HAKAFVDYQN DVTAADIGLA VRENYHSIEH VKRYTALGFG TDQGKLSNVN GVVLTARALQ RPVGEVGTTT YRPAYTPVSL GALAGTMVQD CFDPSRYTAL HEAHVARGAA LEPVGQWLRP WYFARAGEDL RAAVNRECLA ARHGVALMDA STLGKIQIDG PDAREFLNRI YANAWSQLAV GKCRYGLMLD ENGMVMDDGV TACITPRQFY MTTTTGGAAR VLNWLERWHQ TEWPELKVWM TSVTDHWTTI ALVGPKARTV LARLCPDIDL RADSFQFMDW RAGTVHGLPA RVFRISFSGE LAYELNVESG YGHALWEAVM AAGAEFDITP YGTETMHVLR AEKGFIIVGQ DTDGSISPLD LGMGWAVGMK KTYSFLGKRS LARSDTARDD RKQWVGLLTQ DPSVVLPEGA QIMDSARTGA HNRMLGHVTS SYHSAFLGRS IALAVVAAGR QRIGQTLYAH AHGRATAAQV VGSVFVDPKG ERQNV
|
| |