Gene Haur_2024 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_2024 
Symbol 
ID5733913 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp2516191 
End bp2517573 
Gene Length1383 bp 
Protein Length460 aa 
Translation table11 
GC content51% 
IMG OID641279168 
Productxenobiotic compound monooxygenase A subunit 
Protein accessionYP_001544795 
Protein GI159898548 
COG category[C] Energy production and conversion 
COG ID[COG2141] Coenzyme F420-dependent N5,N10-methylene tetrahydromethanopterin reductase and related flavin-dependent oxidoreductases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAACCGC AACGACAGAT GAAACTTGGG GCTTTTCTGC CAGCACCTGG CCACCATGTT 
GCCGCGTGGC GACACCCGAA CACTCCAGCT AATGCTGGGC TTGAAATTCA ACACTATACC
CAGGTGGCTC AAACCGCTGA ACGTGGCAAA TTTGATATGC TTTTCCTCTC GGATGGAGTT
GGCATCCGCA CCCATTATAA AGATGAAGAT GAATTAAGCC GTTGGGGTCG GATTGTTCAG
TTTGAGCCAC TGACCTTACT TTCGGCCTTA GCCATGGTTA CCCAAAAGAT TGGTTTGACG
GCAACTGCTT CAACCACCTA TAACGAGCCA TTTCATATTG CCCGCAAATT TGCTTCGCTC
GATTTTCTGA GCAATGGGCG AGCTGGGTGG AATGTTGTGA CCTCGGTGAC CGATGTTGAG
GCCCAAAATT TCAACCTTCA ACACCAACCT GATCATGCCA CCCGTTATCG GCGGGCACGC
GAATTTATGG ATGTGGTAAC AGGATTGTGG GATAGTTGGG AGGATGATGC CTTTATCTTC
GACAAAGCCA CAGGCCGCTA TTTCGAACCA CAAAAACTAC ATATGTTGCA CCATCGTGGC
GAATTTTTTC AGGTACGCGG GCCGCTGAAC CTTGCTCGTT CGCCCCAAGG CTACCCAGTT
ATTGTGCAGG CTGGCTCATC AGAAGACGGT CAAGATTTTG CGGCTCAATG GGCCGAAGTG
ATTTTTACCG CCCATCAAAC GCTTGAGCAA GCCCAAACAT TTTATCGTGG CATCAAAGGC
CAAATGATTA AGCATGGACG CTCGCCTGAA CAAGCCAAGG TTATGCCTGG AGTGTTTGCA
GTGGTTGGGC AAACCAGAGC CGAGGCCGAA GCCAAATATG CAATCTTACA AGAACTGGTT
GATCCGGTGG TTGGTTTAGG GCTATTGACC GGATTGTTGG GTGATGTTGA TATTTCAGGC
TATCCCTTGG ATGGGCCATT GCCAGAATTA CCGGAAACCC AAGGCAGCAC CAGCCGCCAA
AAACTCGTCT ACGAGCAAGC CCAACGCCAA GGCCTCACGA TTCGCCAATT GTATCTCTCG
GTTGCAGGCG GGCGAGGCCA TCGCTTTATT CTTGGAACCC CCAGCGAGAT CGCCAATCAA
CTTGAGGATT GGTTTGTGAA CGAGGCTGCT GATGGCTTTA ACATCATGCC GCCAAGCTTA
CCTGATGGCT TAAACGACTT TGTTGATTTG GTGATTCCTG AATTACAACG CCGTGGATTG
TTTCGAACTG ACTACGAAGG CACAACCTTA CGTGACCATC TAGGGCTTGA TCGCCCGCTC
AATCGCCCGA ACAAGAGTAC TGCCGAACGT GCCACGCTGG CGATTGCCCG AGGTGCTGAA
TGA
 
Protein sequence
MKPQRQMKLG AFLPAPGHHV AAWRHPNTPA NAGLEIQHYT QVAQTAERGK FDMLFLSDGV 
GIRTHYKDED ELSRWGRIVQ FEPLTLLSAL AMVTQKIGLT ATASTTYNEP FHIARKFASL
DFLSNGRAGW NVVTSVTDVE AQNFNLQHQP DHATRYRRAR EFMDVVTGLW DSWEDDAFIF
DKATGRYFEP QKLHMLHHRG EFFQVRGPLN LARSPQGYPV IVQAGSSEDG QDFAAQWAEV
IFTAHQTLEQ AQTFYRGIKG QMIKHGRSPE QAKVMPGVFA VVGQTRAEAE AKYAILQELV
DPVVGLGLLT GLLGDVDISG YPLDGPLPEL PETQGSTSRQ KLVYEQAQRQ GLTIRQLYLS
VAGGRGHRFI LGTPSEIANQ LEDWFVNEAA DGFNIMPPSL PDGLNDFVDL VIPELQRRGL
FRTDYEGTTL RDHLGLDRPL NRPNKSTAER ATLAIARGAE