Gene Haur_4024 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_4024 
Symbol 
ID5735885 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp5135588 
End bp5137231 
Gene Length1644 bp 
Protein Length547 aa 
Translation table11 
GC content53% 
IMG OID641281174 
ProductFAD dependent oxidoreductase 
Protein accessionYP_001546784 
Protein GI159900537 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG1233] Phytoene dehydrogenase and related proteins 
TIGRFAM ID[TIGR02734] phytoene desaturase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.411315 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCGCAAT TTGATGCGAT TGTGGTTGGC GGTGGGCATA ATGGCTTGAC CTGTGCCTGC 
TATTTGCAAA AGGCCGGAAT CAAAACCCTA GTGATCGAAC GACGAGCGAT CGTCGGCGGC
GCAGTTTGCA CCGAAACCAT GTTTGGTGGC TATAAGATGG ATGTTGGCTC ATCGGCCCAC
ATTATGATTC ACCTGACTCC TGTAGTGCGT GAGCTTGAAC TGCACAAATT TGGGCTTGAA
TATATTGATA TGGACCCATT TGCTTGGTAT CCATTGCCCG ATGGCTCGGG GGCAATTGAA
TTTTGGCGTG ATTTAGACAA GACGTGTGCT TCGATTGAGA AAATTTCACC CAAGGATGCC
CATGCTTATC GCCAATTTGT GGCGTTGTGG GGGCCGCTCA ATGAAGGGGT TTTCGATGTA
TTTCTCAAAG CACCTTCGCC TGCCAACTTA GGCCGCCAAA TGCTAACGGG CCAATTCAAA
GGCGAAAAAG GCACGCATCC GCTGGATATT CTGCGGCGCT TGTTTACCTC GTATGGACAT
TTGATCAACG AAACCTTTGA GAGCGAAGCA ATGCGGGCAG CAATGGGATG GCTAGCAGCG
CAATCTGGCC CACCACCGCA CGAAATTGGC ACCGGCGATT TTGCGGGCTG GCACGCGATG
CTGCATGAAA GTGGCGCGAA ACATCCGCGT GGTGGTTCGG GCATGTTGAC CCAAGCCATG
GCGGCACGCT TCAAAAGTGA TGGCGGTACG CTGCTGCTTG ATGCCCCAGT TGAACGGATT
GTGGTGCAAA ACGGCGTAGT GCACGGCGTA CAATTAACCT CGGGCGAAAC TTACACCGCA
CCAACGGTTA TTTCCAATGC CCATGTGCAA ACCACCTTAT TAAAACTGGT TGAGCCTGAG
CAACTGCCAA ATGGTCTGGT CGAGCGGGTT GGCCGCATTC GCGTTGGCAA TGGCTTTGGG
ATGGCGGTGC GCTGTGCTGC CGATGAATTG CCCGATTATC TGGCTGCGCC TTCTGGTGGT
CGCCCGCATC CTTCACATCA TGGGTTGCAA TTGCTTTGCC CTTCGATCGA CTACCTGAAT
CGCGCGGTTA GCGATTATGA TCGCGGCGTG CCAGCGACCG ATCCAGCGGT AATTGCCATG
ACATTTAGTG CAATCGACCC CGATGTTGCA CCCAAGGGCA AGCATACGCT GTTTTTGTGG
GGTCAATATC ATCCGTATCA ATTAAGTAAT GGCGAAGATT GGGATAGCAT TGCCGAGCGC
GAGGCCGACA AATTACTCGA AGTCGTGTAT CGTTATGCCC CCAATATGCG TGGCAAAATT
AGCAACCGCT ATGTGCAAAC TCCCTTAACC TTGGAGCGCA CCTTTGGCAT GTTGCGTGGT
AATGTGATGC ATGTCGAAAT GTCGTTCGAT CAGATGTTTG CCTTCCGCCC GCTGCCTGAG
CTTTCCGAAT ACCGCGTGGC GGGAATTAAG GGCTTATATT TGACCGGAGC CAGCACCCAT
CCTGGTGGCG GCGTATTTGC GGCCTCAGGT TACAACACCG CCCAAACCGT GCTCAAAGAT
CAGCAGCCAT CACGTCAATG GGTTGGCTGG ACGCTCGGGG CGGCTGCCGC TTTAGGTGTG
GTGGCTTGGG CCAAGAAGAA GTAA
 
Protein sequence
MAQFDAIVVG GGHNGLTCAC YLQKAGIKTL VIERRAIVGG AVCTETMFGG YKMDVGSSAH 
IMIHLTPVVR ELELHKFGLE YIDMDPFAWY PLPDGSGAIE FWRDLDKTCA SIEKISPKDA
HAYRQFVALW GPLNEGVFDV FLKAPSPANL GRQMLTGQFK GEKGTHPLDI LRRLFTSYGH
LINETFESEA MRAAMGWLAA QSGPPPHEIG TGDFAGWHAM LHESGAKHPR GGSGMLTQAM
AARFKSDGGT LLLDAPVERI VVQNGVVHGV QLTSGETYTA PTVISNAHVQ TTLLKLVEPE
QLPNGLVERV GRIRVGNGFG MAVRCAADEL PDYLAAPSGG RPHPSHHGLQ LLCPSIDYLN
RAVSDYDRGV PATDPAVIAM TFSAIDPDVA PKGKHTLFLW GQYHPYQLSN GEDWDSIAER
EADKLLEVVY RYAPNMRGKI SNRYVQTPLT LERTFGMLRG NVMHVEMSFD QMFAFRPLPE
LSEYRVAGIK GLYLTGASTH PGGGVFAASG YNTAQTVLKD QQPSRQWVGW TLGAAAALGV
VAWAKKK