Gene Haur_1159 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_1159 
Symbol 
ID5733052 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp1330754 
End bp1332274 
Gene Length1521 bp 
Protein Length506 aa 
Translation table11 
GC content50% 
IMG OID641278299 
Productaldehyde dehydrogenase 
Protein accessionYP_001543935 
Protein GI159897688 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGGTCTACA CCAATCCCAA TCAGCCGGGC AGTAAAGTCA GTTTTAAGTC ACGGTATGGC 
AACTACATCA ATGGCGAGTT TGTTGAGCCA GTCAAGGGCA TGTATTTTGA AAATATCAGC
CCAGTTAATG GCAAGCCGTT CTGCGAAATT CCTCGTTCGA CCGCCGAAGA CATCGAAAAA
GCGCTCGATG CGGCCCATGC TGCCAAAGCT GCTTGGGGTG CAACCTCACC CGCCCAACGC
GCCAATATTC TGAATAAGAT TGCCGATCGC ATGGAAGCCA ACTTGGAGAT GCTGGCGGTC
GCCGAAACAT GGGAAAATGG CAAGCCTGTG CGCGAAACTC TCGCCGCTGA TTTACCCTTG
GCGATCGATC ACTTCCGCTA TTTTGCTGGG GTAATTCGGG CGCAAGAAGG CAGTGCCGCC
ACAATCGACG AAAATACAAT TGCCTACCAT TTTTATGAGC CGCTGGGCGT GGTGGGCCAA
ATTATTCCGT GGAACTTCCC GCTGTTGATG GCAACTTGGA AATTGGCTCC GGCCCTCGCT
GCTGGCAATT GTGTGGTGCT CAAGCCCGCC GAACAAACGC CCAGCACAAT TTTGGTCTTG
ATGGAATTGA TCGGCGATTT GATTCCAGCG GGCGTGGTCA ATGTGGTCAA TGGCTTTGGG
ATTGAAGCTG GCAAGCCGTT GGCGAGCAGC AATCGGATCG CCAAAATAGC CTTTACGGGC
GAAACCACCA CTGGCCGTTT GATTATGCAA TATGCCTCAG AAAATATCAT TCCTGTGACC
TTAGAGCTTG GTGGCAAATC GCCCAACATC TTCTTCGAGG ATGTTTTGAG CAAGCAAGAT
TCGTTTGTTG ATAAAGCGCT CGAAGGCTTC ACCATGTTTG CCTTGAACCA AGGCGAAGTT
TGTACCTGTC CATCACGGGC ATTAATTCAA AAATCGATCT ATGGTGAGTT TTTAGAGCGA
GCGGTCGAAC GCACCAAACG CTGCATTCAG GGCAATCCAC TTGATCCAGC CACAATGGTG
GGGGCACAAG CCTCCAATGA TCAATTCGAG AAAATTTTGT CGTATTTGGC GATTGGCCGC
GACGAAGGGG CTAAAGTGCT GGTTGGCGGA GCCAAAGCTG AGCTTAGCGG CGATTTAGCT
GAAGGCTATT ATGTTCAGCC GACGATCTTT GCTGGCAACA ACCGCATGCG AATCTTCCAA
GAAGAAATTT TCGGGCCAGT CGTTTCGGTG ACTTCGTTCG ATGATTTTGA CGATGCCTTG
AGTATTGCCA ATGATACCTT GTATGGCTTG GGCGCTGGTT TGTGGACTCG CGATATGAAC
ACAGCCTATC GCATGGGTCG GGCAATTCAA GCAGGCCGTG TTTGGACCAA CTGTTACCAC
TTGTATCCAG CCCATGCTGC GTTCGGTGGC TACAAACACT CGGGGATTGG CCGCGAAAAC
CATAAGATGA TGCTCAACCA TTATCAACAA GTTAAAAACC TGTTGGTCAG CTACGATCCC
AATCCAATGG GCTTCTTTTA G
 
Protein sequence
MVYTNPNQPG SKVSFKSRYG NYINGEFVEP VKGMYFENIS PVNGKPFCEI PRSTAEDIEK 
ALDAAHAAKA AWGATSPAQR ANILNKIADR MEANLEMLAV AETWENGKPV RETLAADLPL
AIDHFRYFAG VIRAQEGSAA TIDENTIAYH FYEPLGVVGQ IIPWNFPLLM ATWKLAPALA
AGNCVVLKPA EQTPSTILVL MELIGDLIPA GVVNVVNGFG IEAGKPLASS NRIAKIAFTG
ETTTGRLIMQ YASENIIPVT LELGGKSPNI FFEDVLSKQD SFVDKALEGF TMFALNQGEV
CTCPSRALIQ KSIYGEFLER AVERTKRCIQ GNPLDPATMV GAQASNDQFE KILSYLAIGR
DEGAKVLVGG AKAELSGDLA EGYYVQPTIF AGNNRMRIFQ EEIFGPVVSV TSFDDFDDAL
SIANDTLYGL GAGLWTRDMN TAYRMGRAIQ AGRVWTNCYH LYPAHAAFGG YKHSGIGREN
HKMMLNHYQQ VKNLLVSYDP NPMGFF