Gene Veis_2021 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagVeis_2021 
Symbol 
ID4692977 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameVerminephrobacter eiseniae EF01-2 
KingdomBacteria 
Replicon accessionNC_008786 
Strand
Start bp2297581 
End bp2298459 
Gene Length879 bp 
Protein Length292 aa 
Translation table11 
GC content73% 
IMG OID639849787 
ProductHAD family hydrolase 
Protein accessionYP_996791 
Protein GI121608984 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0647] Predicted sugar phosphatases of the HAD superfamily 
TIGRFAM ID[TIGR01459] HAD-superfamily class IIA hydrolase, TIGR01459
[TIGR01460] Haloacid Dehalogenase Superfamily Class (subfamily) IIA 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.455576 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGCGCT GGGCCGCAGG CTTGGCGGAA CTGGCCCCGC ATTTCGACGG CTTCGTGCTC 
GACCAGTTCG GCGTGCTCCA TGACGGCCAG GCGCCATACC CCGGCGTTGC CGACGCGCTG
CGCCAACTGC GCGCCCATGC CAAGCGCGTG CTGGTGCTGA GCAACTCCGG CAAGCGCGCC
GCGTACAACC GCCAGCGGCT GGCAGGCTTT GGCATCACGC CCGGGCTGTA CGACGACCTC
ATCAGTTCCG GCGAACTATG CCGGCAGATG CTCGCGCGCC GCGACCGCGC GCCCTGGGCC
ACGCTGGGCC GGCGCGTGCT GCTGCTCGAC CCCGGGCAGG ACCGGCCGCT GATCGACGCG
CTGGCGCTCG ATGCCGTCGA TAGCGTCGAG CAGGCCGACT TCATCCTGCT GGCCAGCCTG
GCCGACGGCA TGCAGCCGGC CAGCCTGCAA GCGCTGCTCG ACGCGGCCGC AGCGCGCCGC
CTGCCGCTGG TCTGCGCCAA CCCCGACCGG CAGCGCCTGA CGCTGCATGG CATAGCCCCC
GGCAGTGGCA GCGTGGCCGC GCATTACGAA CAAATGGGCG GCATGGTGGT CTGGGTGGGC
AAGCCGTATC CGCTGATTTA CGCCGCCTGC CGCGAGCGGC TGGCCGGTCT CGGCGCCGAG
CGCATCTGCG CGCTCGGCGA CTCGATCGAG CATGACCTGC TCGGCGGCAG CCGCGCCGGG
CTGGCCACCT GCTTTGTCGC CGGCGGCCTG CATGCGCAGG ACTTCGAGCG CGCCGGCGCC
GCGAACCGCG CTGCCGAACT GCAACGCCTG CTCGCGCTGC CCGCTGCCCA TGGCGCGCCC
GCGCCGGCCT GGGCGCTGCC GCGCCTGCAA TGGAGCTAG
 
Protein sequence
MTRWAAGLAE LAPHFDGFVL DQFGVLHDGQ APYPGVADAL RQLRAHAKRV LVLSNSGKRA 
AYNRQRLAGF GITPGLYDDL ISSGELCRQM LARRDRAPWA TLGRRVLLLD PGQDRPLIDA
LALDAVDSVE QADFILLASL ADGMQPASLQ ALLDAAAARR LPLVCANPDR QRLTLHGIAP
GSGSVAAHYE QMGGMVVWVG KPYPLIYAAC RERLAGLGAE RICALGDSIE HDLLGGSRAG
LATCFVAGGL HAQDFERAGA ANRAAELQRL LALPAAHGAP APAWALPRLQ WS