Gene Veis_3999 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagVeis_3999 
Symbol 
ID4694046 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameVerminephrobacter eiseniae EF01-2 
KingdomBacteria 
Replicon accessionNC_008786 
Strand
Start bp4389510 
End bp4391327 
Gene Length1818 bp 
Protein Length605 aa 
Translation table11 
GC content68% 
IMG OID639851748 
Productsulfoacetaldehyde acetyltransferase 
Protein accessionYP_998724 
Protein GI121610917 
COG category[E] Amino acid transport and metabolism
[H] Coenzyme transport and metabolism 
COG ID[COG0028] Thiamine pyrophosphate-requiring enzymes [acetolactate synthase, pyruvate dehydrogenase (cytochrome), glyoxylate carboligase, phosphonopyruvate decarboxylase] 
TIGRFAM ID[TIGR03457] sulfoacetaldehyde acetyltransferase 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.422194 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0161793 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCCAGA CCCCCCCCGC CCCCCGCCAA GCCGTCACCG GCGTGCAGAA GATGACGCCT 
TCGGAGGCCC TCGTCGAGAC CCTGGTCGCC AATGGCGTGA CCGACATCTT CGGCATCATG
GGCTCGGCCT TCATGGACGC AATGGATATC TTCGCCCCCG CCGGCATCCG CCTGATTCCG
GTGGTGCACG AGCAGGGGGG CGCCCATATG GCCGATGGCT ATGCGCGGGT TTCCGGGCGC
CACGGCCTGG TGATCGGCCA GAACGGCCCC GGCATCAGCA ACTGCGTGAC GGCGATCGCG
GCCGCTTACT GGGCGCACAG CCCGGTGGTG TTGATCACGC CCGAGATTGG CACCATGGGC
ATGGGCCTGG GCGGCTTTCA GGAGGCCAAC CAACTGCCGA TGTTCCAGGA GTTCACCAAG
TACCAGGGCC ATGTGAACAA CCCCCGGCGC ATGGCCGAGT ACGCCGCGCG CTGCTTTGAC
CGCGCCATCT CCGAGATGGG TCCGACGCAG TTGAACATTC CGCGCGACTA CTTCTACGGC
GAGATCAGCA CCGAGATTGC GCAGCCGATG CGCGTGGAGC GCGGCGCGGG CGGCGAGAAC
AGCCTCGACG CAGCGGTCGA GTTGCTGGCT GCGGCCAGGT TTCCGGTGAT CCTCTCCGGC
GGCGGCGTGG TGATGGGCGA TGCGGTGGCC CAATGCCAGG CGCTGGCCGA ACGCCTGGGC
GCGCCGGTGG CCAACGGCTA CTTGCGCAAC GACTCCTTTC CGGCGAGCCA CCCGCTGTGG
GTCGGCCCGC TGGGCTACCA AGGCTCCAAG GCGGCGATGA AACTGATCGC GCAGGCCGAC
GTGGTGCTCG CGCTCGGCTC GCGCATGGGC CCGTTCGGCA CGCTGGCGCA GTACGGCATC
GACTACTGGC CCAAGGACGC GAAGATCATC CAGGTCGAGG CCGACCACAC CAACCTGGGG
CTGGTCAAGA AGATCACCGT GGGCATCCAC GGCGATGCCA AGGCCAGCGC CCGGGAACTG
CTCAGGCGCC TGCAGGGCAG GGCGCTGGCC TGCGACGCCA ACCGGGCCGG GCGCGCCCGG
ACGATCGGGG CCGAGAAGGC CGCGTGGGAG AAAGAGCTCG ACGAATGGAC CCACGAGCGC
GACCCGTTCA GCCTGGACGC GATCGAGGAG GCCAGGGGGG AGAAAACCGC CACCGGCGGC
AACTACCTGC ACCCGCGCCA GGTGCTGCGC GAGCTTGAAA AAGCCATGCC GCCGCGCGTG
ATGGTGGCCA CCGACGTGGG CAACATCAAC GCCATTGCCA ACAGCTACCT GCGTTTCGAG
GAGCCGCGCT CGTTCTTCGC GCCGATGAGC TTTGGCAACT GCGGCTATGC GCTGCCGACG
GTGATCGGCG CCAAGTGCGC CGCGCCCGAC CGGCCGGCGC TGGCCTACGC CGGCGATGGC
GCCTGGAGCA TGAGCATGGT CGAGGTGATG ACGGCCGTGC GCCACGACAT TCCGGTGACG
GCGGTGGTGT TCCATAACCG CCAGTGGGGC GCGGAGAAGA AGAACCAGGT CGATTTCTAC
AACCGCCGCT TCGTTGCCGG CGAACTCGAC AAGCAGAACT TCGCGGGCAT CGCCCGTGCC
ATGGGCGCCG AGGGCATCGT GGTCGACCAG TTGCAGGACG CAGGCCCGGC GCTGAAGAAG
GCCATCGACC TGCAAATGAA CCAGCGCAAG ACCTGCGTGA TCGAGGTCAT GTGCACCCGT
GAACTGGGCG ACCCCTTCCG GCGCGATGCG CTGTCCAAGC CGGTGCGCTT GCTGCCCCGG
TACAAGGACT ATGTCTGA
 
Protein sequence
MSQTPPAPRQ AVTGVQKMTP SEALVETLVA NGVTDIFGIM GSAFMDAMDI FAPAGIRLIP 
VVHEQGGAHM ADGYARVSGR HGLVIGQNGP GISNCVTAIA AAYWAHSPVV LITPEIGTMG
MGLGGFQEAN QLPMFQEFTK YQGHVNNPRR MAEYAARCFD RAISEMGPTQ LNIPRDYFYG
EISTEIAQPM RVERGAGGEN SLDAAVELLA AARFPVILSG GGVVMGDAVA QCQALAERLG
APVANGYLRN DSFPASHPLW VGPLGYQGSK AAMKLIAQAD VVLALGSRMG PFGTLAQYGI
DYWPKDAKII QVEADHTNLG LVKKITVGIH GDAKASAREL LRRLQGRALA CDANRAGRAR
TIGAEKAAWE KELDEWTHER DPFSLDAIEE ARGEKTATGG NYLHPRQVLR ELEKAMPPRV
MVATDVGNIN AIANSYLRFE EPRSFFAPMS FGNCGYALPT VIGAKCAAPD RPALAYAGDG
AWSMSMVEVM TAVRHDIPVT AVVFHNRQWG AEKKNQVDFY NRRFVAGELD KQNFAGIARA
MGAEGIVVDQ LQDAGPALKK AIDLQMNQRK TCVIEVMCTR ELGDPFRRDA LSKPVRLLPR
YKDYV