Gene VIBHAR_02420 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagVIBHAR_02420 
Symbol 
ID5554362 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameVibrio harveyi ATCC BAA-1116 
KingdomBacteria 
Replicon accessionNC_009783 
Strand
Start bp2431616 
End bp2433136 
Gene Length1521 bp 
Protein Length506 aa 
Translation table11 
GC content49% 
IMG OID640907906 
ProductNAD-dependent aldehyde dehydrogenase 
Protein accessionYP_001445609 
Protein GI156974702 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATTTATG CACAGCCAGG TAGTGAAAAT GCCATCGTTA ATTTTAAATC GCATTACGAC 
AACTACATCG GTGGCGAATG GGTAAAACCA ACGGGTGGTG AATACTTTGA CAATACATCG
CCAATCAATG GCCAAGCGTA TTGTAAGGTG GCTCGTTCTG GAGAAGCAGA CATCAATCTT
GCACTGGATG CTGCACATGG CGTAAGAGCG CAATGGGCGA AAACCAGTGT GACTGAACGC
TCGAACATTC TACTGAAGAT TGCCGATCGC ATTGAACAAC ATCTTGAAGA GCTGGCTGTG
GCAGAAACAT GGGAAAACGG CAAACCCGTT CGTGAAACGT TAGCCGCAGA CTTGCCTTTG
GTCGTCGACC ATTTTCGTTA CTTCGCAGGT TGTATCCGTG CTCAAGAAGG CAGTGCGGCC
GAGTTAGATG AGCACACCGC AAGTTATCAC TTCCCTGAGC CGATTGGGGT TGTTGGGCAG
ATCATCCCTT GGAACTTCCC TATGTTGATG GCGGCTTGGA AACTTGCTCC TGCATTGGCT
GCGGGCTGTT GTGTTGTGCT AAAACCTGCC GAACAAACGC CAACTTCGAT TCTCGTTCTG
ATGGAAACAA TTGGTGACCT TCTGCCTGCT GGCGTGGTGA ATGTAGTCAA TGGCTTTGGC
TCAGAAGCAG GGCAAGCACT GGCGACGAGT AACCGAATCG CTAAGTTGGC ATTTACGGGT
TCAACAGAAG TCGGTAACCA TATTTTGAAA TGCGCCGCAG AGAACCTTAT TCCATCAACG
GTTGAGCTTG GTGGTAAATC TCCAAATATC TACTTCCCAG ATGTGTTTGA CCACGAAGAC
GAATACCTAG ATAAGTGCAT TGAAGGCACT TTACTTGCGT TCTTCAACCA AGGGGAAGTG
TGTACCTGTC CATCACGTGT GTTAGTGCAT GAGTCTGTGT ACGACAAGTT TATTGCTAAA
GTCGCAGAGC GCGCGCAGAC AATCAAGCAG GGTAATCCAC TCGATACAGA TACTCAGGTT
GGCGCTCAAG CCTCGCAAGA GCAGTTCGAT AAGATCCTGA GCTACCTAGA AATCGGTCGT
CAAGAAGGCG CGAAAGTCGT GTTTGGTGGC GATGTAGCGA AACAAGAAAA CGACTTGGAG
CAAGGCTACT ATATCCAACC GACTTTGTTA CAAGGTCACA ACAAGATGCG CGTATTCCAA
GAAGAGATCT TTGGCCCAGT CATCGCAGTG ACCACCTTCA AAGATGAAGC GGAAGCACTA
GCGATTGCCA ATGACACGGA ATATGGCTTG GGTGCAGGTG TGTGGACGCG AGACCAAAAC
CTTGCTTATC GCATGGGGCG CAACATTGAA GCAGGGCGCA TTTGGATTAA CTGCTATCAC
GCTTACCCAG CACACGCTGC GTTTGGTGGC TACAAGAAAT CAGGTATCGG CCGTGAGACG
CATAAGATGA TGCTTGATCA CTACCAAAAC ACGAAAAACC TACTCATTAG CTACGACGTC
AATCCGTTAG GCTTCTTCTA A
 
Protein sequence
MIYAQPGSEN AIVNFKSHYD NYIGGEWVKP TGGEYFDNTS PINGQAYCKV ARSGEADINL 
ALDAAHGVRA QWAKTSVTER SNILLKIADR IEQHLEELAV AETWENGKPV RETLAADLPL
VVDHFRYFAG CIRAQEGSAA ELDEHTASYH FPEPIGVVGQ IIPWNFPMLM AAWKLAPALA
AGCCVVLKPA EQTPTSILVL METIGDLLPA GVVNVVNGFG SEAGQALATS NRIAKLAFTG
STEVGNHILK CAAENLIPST VELGGKSPNI YFPDVFDHED EYLDKCIEGT LLAFFNQGEV
CTCPSRVLVH ESVYDKFIAK VAERAQTIKQ GNPLDTDTQV GAQASQEQFD KILSYLEIGR
QEGAKVVFGG DVAKQENDLE QGYYIQPTLL QGHNKMRVFQ EEIFGPVIAV TTFKDEAEAL
AIANDTEYGL GAGVWTRDQN LAYRMGRNIE AGRIWINCYH AYPAHAAFGG YKKSGIGRET
HKMMLDHYQN TKNLLISYDV NPLGFF