Gene Haur_4391 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_4391 
Symbol 
ID5736241 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp5609853 
End bp5611013 
Gene Length1161 bp 
Protein Length386 aa 
Translation table11 
GC content50% 
IMG OID641281553 
Productacyl-CoA dehydrogenase domain-containing protein 
Protein accessionYP_001547151 
Protein GI159900904 
COG category[I] Lipid transport and metabolism 
COG ID[COG1960] Acyl-CoA dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAACTCA ACGAAGAGCT ACGCATGTTG CGTGAAACCG TGCGCGATTT TGCCGTCAAA 
GAAATTAAGC CAATTGCCCG CGAAGTTGAT GAAGCAGAAC GCGTACCATT CGAGACAATT
CGCAAGGCTG GTAATTTGGG CTTGCTCGGC ATTCCCTTCC CCGAGGAATA CGGCGGCGCT
GATGCAGGGA TCGTCGGCTA TTGTATTTTG CAAGAGGAAA TCAACCGTTA TTGTGCCTCA
ACTGGCACGA TTATTGGCGC ACATGCCCAG CTTTGTGGCA TGTCGATTTT TCTATCGGGC
AATGATGAGC AAAAATCACG CTATCTTAGC AAACTCAACG AAGGCAAATT ATTGGGTGCT
TGGGCACTCA CCGAGCCAAA TGCCGGCTCC GATGCGGCCA GCATTACCAC CACCGCAACC
CAAGATGGCG ACGATTGGAT CATCAACGGC GGCAAGATGT GGATCACCAA TGGCTCGTTT
GCCGATGTGA TTGTGGTGTT TGCCTCGACC AATCGCGAGC GCGGCGCACG CGGCGGCATC
AGCGCATTTA TCGTCGAAAA AGATTTCGCG GGCTTCAAAG TTGGCAAAGT CGAGGAAAAA
ATGGGCTTGC GAGCCTCGCA TACCGCCTCG ATCTTTTTTG AAAATTGCCG TGTGCCCGCT
AAAAATGTAT TGGGCGAAAT TGGAGCAGGC TTTGGCGTTG CCATGAAAAC CCTCGATATT
GGGCGCTGTG GTTTGGGCGC AAGTGCGCTT GGCTCAGCCA AAGAAGCCTA CGATTTGGCC
TTGCACTACA GCGTTGAACG CCAACAATTT GGCCGCCCAA TCGCCGATTT CCAAGCAATT
CAGATCAAAT TGGCCGAAAT GCGCACTAAA ATCTATGCTA TGGAGCAAAT GGTTTATCAC
TGCGCAGCCT TGGTTGATGC CAAAAAGCCA GCCACGCTTG AATCATCGAT GGTCAAGTTG
TTCTGCACCG AGGCTGCTTC GCAAATTATC GACGAAGCGA TTCAAATTTA TGGTGGAATG
GGCTTTAGCC GCGAAGTACC ACTCGAACGG ATGTATCGCG ATGCTCGGGT CACGCGGATT
TTTGAAGGCA CAAACGAAAT TCAAAAATAT GTGATTGCTG GCGAAGAACT CAAAAAACTG
GGCCATAAAA TCAAAATGTA G
 
Protein sequence
MELNEELRML RETVRDFAVK EIKPIAREVD EAERVPFETI RKAGNLGLLG IPFPEEYGGA 
DAGIVGYCIL QEEINRYCAS TGTIIGAHAQ LCGMSIFLSG NDEQKSRYLS KLNEGKLLGA
WALTEPNAGS DAASITTTAT QDGDDWIING GKMWITNGSF ADVIVVFAST NRERGARGGI
SAFIVEKDFA GFKVGKVEEK MGLRASHTAS IFFENCRVPA KNVLGEIGAG FGVAMKTLDI
GRCGLGASAL GSAKEAYDLA LHYSVERQQF GRPIADFQAI QIKLAEMRTK IYAMEQMVYH
CAALVDAKKP ATLESSMVKL FCTEAASQII DEAIQIYGGM GFSREVPLER MYRDARVTRI
FEGTNEIQKY VIAGEELKKL GHKIKM