Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Veis_3999 |
Symbol | |
ID | 4694046 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Verminephrobacter eiseniae EF01-2 |
Kingdom | Bacteria |
Replicon accession | NC_008786 |
Strand | - |
Start bp | 4389510 |
End bp | 4391327 |
Gene Length | 1818 bp |
Protein Length | 605 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 639851748 |
Product | sulfoacetaldehyde acetyltransferase |
Protein accession | YP_998724 |
Protein GI | 121610917 |
COG category | [E] Amino acid transport and metabolism [H] Coenzyme transport and metabolism |
COG ID | [COG0028] Thiamine pyrophosphate-requiring enzymes [acetolactate synthase, pyruvate dehydrogenase (cytochrome), glyoxylate carboligase, phosphonopyruvate decarboxylase] |
TIGRFAM ID | [TIGR03457] sulfoacetaldehyde acetyltransferase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.422194 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 4 |
Fosmid unclonability p-value | 0.0161793 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCCAGA CCCCCCCCGC CCCCCGCCAA GCCGTCACCG GCGTGCAGAA GATGACGCCT TCGGAGGCCC TCGTCGAGAC CCTGGTCGCC AATGGCGTGA CCGACATCTT CGGCATCATG GGCTCGGCCT TCATGGACGC AATGGATATC TTCGCCCCCG CCGGCATCCG CCTGATTCCG GTGGTGCACG AGCAGGGGGG CGCCCATATG GCCGATGGCT ATGCGCGGGT TTCCGGGCGC CACGGCCTGG TGATCGGCCA GAACGGCCCC GGCATCAGCA ACTGCGTGAC GGCGATCGCG GCCGCTTACT GGGCGCACAG CCCGGTGGTG TTGATCACGC CCGAGATTGG CACCATGGGC ATGGGCCTGG GCGGCTTTCA GGAGGCCAAC CAACTGCCGA TGTTCCAGGA GTTCACCAAG TACCAGGGCC ATGTGAACAA CCCCCGGCGC ATGGCCGAGT ACGCCGCGCG CTGCTTTGAC CGCGCCATCT CCGAGATGGG TCCGACGCAG TTGAACATTC CGCGCGACTA CTTCTACGGC GAGATCAGCA CCGAGATTGC GCAGCCGATG CGCGTGGAGC GCGGCGCGGG CGGCGAGAAC AGCCTCGACG CAGCGGTCGA GTTGCTGGCT GCGGCCAGGT TTCCGGTGAT CCTCTCCGGC GGCGGCGTGG TGATGGGCGA TGCGGTGGCC CAATGCCAGG CGCTGGCCGA ACGCCTGGGC GCGCCGGTGG CCAACGGCTA CTTGCGCAAC GACTCCTTTC CGGCGAGCCA CCCGCTGTGG GTCGGCCCGC TGGGCTACCA AGGCTCCAAG GCGGCGATGA AACTGATCGC GCAGGCCGAC GTGGTGCTCG CGCTCGGCTC GCGCATGGGC CCGTTCGGCA CGCTGGCGCA GTACGGCATC GACTACTGGC CCAAGGACGC GAAGATCATC CAGGTCGAGG CCGACCACAC CAACCTGGGG CTGGTCAAGA AGATCACCGT GGGCATCCAC GGCGATGCCA AGGCCAGCGC CCGGGAACTG CTCAGGCGCC TGCAGGGCAG GGCGCTGGCC TGCGACGCCA ACCGGGCCGG GCGCGCCCGG ACGATCGGGG CCGAGAAGGC CGCGTGGGAG AAAGAGCTCG ACGAATGGAC CCACGAGCGC GACCCGTTCA GCCTGGACGC GATCGAGGAG GCCAGGGGGG AGAAAACCGC CACCGGCGGC AACTACCTGC ACCCGCGCCA GGTGCTGCGC GAGCTTGAAA AAGCCATGCC GCCGCGCGTG ATGGTGGCCA CCGACGTGGG CAACATCAAC GCCATTGCCA ACAGCTACCT GCGTTTCGAG GAGCCGCGCT CGTTCTTCGC GCCGATGAGC TTTGGCAACT GCGGCTATGC GCTGCCGACG GTGATCGGCG CCAAGTGCGC CGCGCCCGAC CGGCCGGCGC TGGCCTACGC CGGCGATGGC GCCTGGAGCA TGAGCATGGT CGAGGTGATG ACGGCCGTGC GCCACGACAT TCCGGTGACG GCGGTGGTGT TCCATAACCG CCAGTGGGGC GCGGAGAAGA AGAACCAGGT CGATTTCTAC AACCGCCGCT TCGTTGCCGG CGAACTCGAC AAGCAGAACT TCGCGGGCAT CGCCCGTGCC ATGGGCGCCG AGGGCATCGT GGTCGACCAG TTGCAGGACG CAGGCCCGGC GCTGAAGAAG GCCATCGACC TGCAAATGAA CCAGCGCAAG ACCTGCGTGA TCGAGGTCAT GTGCACCCGT GAACTGGGCG ACCCCTTCCG GCGCGATGCG CTGTCCAAGC CGGTGCGCTT GCTGCCCCGG TACAAGGACT ATGTCTGA
|
Protein sequence | MSQTPPAPRQ AVTGVQKMTP SEALVETLVA NGVTDIFGIM GSAFMDAMDI FAPAGIRLIP VVHEQGGAHM ADGYARVSGR HGLVIGQNGP GISNCVTAIA AAYWAHSPVV LITPEIGTMG MGLGGFQEAN QLPMFQEFTK YQGHVNNPRR MAEYAARCFD RAISEMGPTQ LNIPRDYFYG EISTEIAQPM RVERGAGGEN SLDAAVELLA AARFPVILSG GGVVMGDAVA QCQALAERLG APVANGYLRN DSFPASHPLW VGPLGYQGSK AAMKLIAQAD VVLALGSRMG PFGTLAQYGI DYWPKDAKII QVEADHTNLG LVKKITVGIH GDAKASAREL LRRLQGRALA CDANRAGRAR TIGAEKAAWE KELDEWTHER DPFSLDAIEE ARGEKTATGG NYLHPRQVLR ELEKAMPPRV MVATDVGNIN AIANSYLRFE EPRSFFAPMS FGNCGYALPT VIGAKCAAPD RPALAYAGDG AWSMSMVEVM TAVRHDIPVT AVVFHNRQWG AEKKNQVDFY NRRFVAGELD KQNFAGIARA MGAEGIVVDQ LQDAGPALKK AIDLQMNQRK TCVIEVMCTR ELGDPFRRDA LSKPVRLLPR YKDYV
|
| |