Gene Veis_4500 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagVeis_4500 
Symbol 
ID4693951 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameVerminephrobacter eiseniae EF01-2 
KingdomBacteria 
Replicon accessionNC_008786 
Strand
Start bp4980427 
End bp4981917 
Gene Length1491 bp 
Protein Length496 aa 
Translation table11 
GC content67% 
IMG OID639852246 
Productaldehyde dehydrogenase 
Protein accessionYP_999218 
Protein GI121611411 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.947339 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACCCCT CCCTCGACGC ACTGATCACC GAATTTCGCA ACACCGGCAC ACTCAGCGGC 
TTGCCGTGCG CGCAATTCAT CGACGGCGCT TATCAGGCTG GCGCGACCAG CGCTGCGCTG
GAGAGCTTCG ACCCCGGCAG TGGCCGTGTC TTTGCGCAAT TTGCCGCCGG CGGCGAGGTC
GAGGTCGATG CGGCCGTGCA CGCTGCAAAG CGCGCCTTGC CTGCCTGGAG CGAGACCGCG
CCCGCCGAGC GCAGCCGCGT GCTCTGGGCC ATTGCGCAGC AGGTGCGCCA ACATGCCGAC
CGGCTGGCGC TCATCGAGTG CCTGGATAGC GGCAAGCGGC TTGGCGAGGC CCAGGGCGAT
GTGCGCGGCG TGATACGCGC CTTCGAGTAT TACGCGGGCG CGGTAGACAA GATGCAGGGG
GACAGCTTTC CGCTCGGCAA GGACTACATC GGCTTTACCG TCGAGGAGCC GATCGGCGTT
GCGGCGCAGA TCATCCCTTG GAATTACCCG GTCGGCACCG CCGCGCGCGG CATCGCTCCG
GCACTGGCCG CCGGTTGCAC CGTGGTTGCC AAACCCGCAG AGCAAACCCC CTTGTCGGCG
TTGCTGTTGG CCGAACTGGC CAGCGCCGCC GGCCTGCCTG CGGGCGTGCT CAACGTCGTC
ACCGGCACTG GCGCGGCGGC GGGCGCAGCG CTGGTAGCGC ACCCGGGCAT TGGCCACATC
ACCTTCACCG GTTCGGTCGC CACCGGGCAA CGGGTCATGC GCGCCGCCGC CGACAACGTG
ACGCGCGTGC TGCTCGAACT GGGCGGCAAA TCACCGCTGG TCGTTCTGGC CGACTGCGAT
GTCGAGCAGG CGCTCGATGG CGTACTCGGC GCCATCTACG AGAACGCCGG GCAGATCTGC
TCGGCAGGCT CGCGCCTCAT CATCGAACGC AAACTGCATG GCAGCTTCAT GGCGCGGCTG
CTGGAGCGGG TCGGCAAACT GGGGCTGGGT CACGGCCTGG ACCAGCCGGA TGTCGGCCCG
GTCAATTCGC TGCTGCATCT GAACCGCATA CACGAGCATG TCGCCACCGC CGCCGCGCGC
GGCAATGCGG TGCTTGCGGG CGGCAGCATC GCCCAGCCCG CGAACGCGCC GGGCGGATGG
TTTTATCGCC CCACGGTGAT CGAGGCACGG TCGGCGCAAG ACCCGGTGGT GCAGGAAGAA
ATCTTCGGCC CGGTGCTGGT GGTGCAGCAA GCCGACGATC TGGAACAAGC CATCGCACTG
GCCAACGGCA CCGACTACGC GCTGGTGGCA GGCATTTACA CCAGAGACAT CACGCAGGCT
TACCGCTTCG CCCGCCGTGT CGATGCCGGG CAGGTCTACA TCAACGAATA CTTCGCCGGC
GGCATCGAAG TCCCGTTCGG AGGCAACCGC AAATCCGGCT TTGGCCGTGA AAAAGGTCTG
GAAGGAATCA AGAGCTATTG CAAGCTCAAG AGTGTGGTGG CGCGGGTTTA G
 
Protein sequence
MNPSLDALIT EFRNTGTLSG LPCAQFIDGA YQAGATSAAL ESFDPGSGRV FAQFAAGGEV 
EVDAAVHAAK RALPAWSETA PAERSRVLWA IAQQVRQHAD RLALIECLDS GKRLGEAQGD
VRGVIRAFEY YAGAVDKMQG DSFPLGKDYI GFTVEEPIGV AAQIIPWNYP VGTAARGIAP
ALAAGCTVVA KPAEQTPLSA LLLAELASAA GLPAGVLNVV TGTGAAAGAA LVAHPGIGHI
TFTGSVATGQ RVMRAAADNV TRVLLELGGK SPLVVLADCD VEQALDGVLG AIYENAGQIC
SAGSRLIIER KLHGSFMARL LERVGKLGLG HGLDQPDVGP VNSLLHLNRI HEHVATAAAR
GNAVLAGGSI AQPANAPGGW FYRPTVIEAR SAQDPVVQEE IFGPVLVVQQ ADDLEQAIAL
ANGTDYALVA GIYTRDITQA YRFARRVDAG QVYINEYFAG GIEVPFGGNR KSGFGREKGL
EGIKSYCKLK SVVARV