Gene Haur_2522 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_2522 
Symbol 
ID5734400 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp3222114 
End bp3225275 
Gene Length3162 bp 
Protein Length1053 aa 
Translation table11 
GC content50% 
IMG OID641279662 
Productcytochrome P450 
Protein accessionYP_001545288 
Protein GI159899041 
COG category[P] Inorganic ion transport and metabolism
[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG0369] Sulfite reductase, alpha subunit (flavoprotein)
[COG2124] Cytochrome P450 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTATTT CCAGCCCAAT TCGGTATATT CCCCAACCAC CAACCCGCCC AATCGTCGGC 
AATGTGCCCG ATATTGGCAT GGAAACGCCT GTCCAGAATT TGATGAAATT AGCTCAGCAT
TATGGGCCAA TTTTCCGCGT GAGTTTTCCT AATCGCAGCG TGTTGGTCGT TTCTTCGGCT
GAACTTGTGG CCGAAATTAG CGATCAACAA CGTTTCGATA AATTATTGCA TGGGCCATTA
ATTCAAATTC GCGATTTTGC AGGCGATGGC TTGTTTACGG CCTATACCGA GGAAGCCAAT
TGGAGCAAAG CCCATCGTTT ATTGATGCCA GCCTTCGGGC CAGCCAGTAT GCGTAATTAT
TTCGACGACA TGCTCGATAT TGCCGACCAA TTATTTACTA AATGGGAGCG TCAAGGGCCA
GAAACCGATT TTGATGTGGC CGATAATATG ACGCGCCTAA CGCTCGATAC GATTGCCTTA
TGTGGTTTTG GCTATCGGTT TAATTCGTTT TATCAACGCG AAATGCACCC ATTTGTTGAA
GCAATGGTGC GAGCCTTAGC CGAGGCTGGG GCACGCGCCC GCCGTTTATC AATTCAAACG
AAATTAATGC GCTCAACCCA GCGTCAATAT GAAGCTGATA TGCAGTATAT GCACGGCATC
ACCGATGAAT TAATTGCTAA ACGGCGCAGT TTGCCAAGCA ACGAAGTTCC CAACGATCTG
CTAGGATTAA TGCTCAATGC CAAAGATTCG ATCACCGGTG AAGGCTTAGA TGATGCCAAT
ATTCGCAATC AACTGGTGAC ATTTTTGATT GCTGGCCACG AAACCACCAG CGGCCTGCTC
TCGTTTGCAA CCTACTTTTT GCTCCAACAG CCTGAAATTT TGCAACGCGC TCAAGCCATC
GTCGATCAAG TGCTCGGCGA TCGGCTGCCA CGCTACGAAG ACTTGGCCAA ACTGGGCTAC
CTCGACCAAA TTTTGCGCGA AACCTTGCGG CTCTGGCCAA CCGCGCCTGT TTTTGGGGTT
TATGCCAAGC ACGATACTAA CATTGGTGGC TTTCCGATTA AGCAGGGCGA AAAATTCATA
GCCTTATTGC CAACTTTGCA CCGCGATCCC AAAGTTTGGC TCAACCCCAA CCAATTTGAT
CCCGATCGCT TTGCGCCTGA AGTGAGGGAA CAAATCCCTG AGCACGCTTG GAAGCCATTT
GGCAATGGCC AACGCGCCTG TATTGGGCGT TCATTTGCCA TGCAAGAGGC CAGCTTGGTT
TTAGCAATGA TGCTGCAACG TTTTGAATTA TCGCAACCGC AACCCTACCA GTTGCATGTC
AAAGAAACCC TAACGCTCAA ACCTGAAGGC TTGACCGTTC GAGCACGGGT ACGCAAAAAC
ATCGTGCGCA GCACCAAGCC AACTCAGCCA AATGTAGCAA TTCAATCAAA CCCAAATCAA
GCCCAACACA ATATCCCATT GCTGGTGCTG TATGGCTCTA ATTCTGGCTC ATCTGAAGCC
TTCGCTCGCC GAATTGCCAG CGATGGTGAG GCACGCGGTT ACCAAACAAG CGTGGCTGCG
CTCAATAATT ATGTCAATAA ATTACCAACC ACCGGAGCAG TGAGCATCGT GGCGGCCTCA
TACAACGGCC AGCCTGCCGA TAATGCCCAA GCCTTCTGTC AATGGTTAGC TGGCGTTGAG
CCAAACTCGC TCAAAGGCGT GCGCTATAGC GTTTTTGGCT GTGGCAACCG CGATTGGCAG
AGCACCTACC AAGCTGTGCC GACTCAAATT GATCAACACT TGCAGGCCGC CGGAGCCGAA
CGTTTGCTTC AACGCGGTGC AGCCGATGCC CGCAGCGATT TCTTTGGTGA TTTTGAGCGT
TGGTATGCGC CGTTTTGGCA AACCCACAAC CAAACATTTG CAATCGCAAG CGCCGAAATT
AACAGCAAAC CACTGTACAA GGTCGAATTA CTGCCATCAA GCAGCGATCA GTTGGCCCAA
CAAACGGGCT TTATGTTTGC TAGCGTGCTC GAAAATCGCG AATTAGTTGA TCTCAGCTCG
CCTTTGGGTC GTTCAAAACG CCACATCGAA TTACGTTTGC CAAACGAACT GCAATACCAA
GCTGGTGATT ATTTAGCGAT CTTGCCGCAA AATCATCCTA GCCTAATCGA GCGGGCTTGC
AAACATTTTG GGCTAAAACC TGAACAAACT ATAATTTTGC ATGCTACACG CGGGGCTGCC
AACCTGCCAA TTGATCGCCC GATTAGCCTA GGTGAATTGC TGAGCAGCCA CGTTGAATTA
GCAACTCCCG CCACGCAGCG CGATTTGGAG TTGTTGGCGC AGAAGAATGT TTGTCCGTCA
CACCAAATTC ATTTAGCTGC ACTGGCCGCA GATCACGAAC GCTATACCAC CGAGATTTTG
CAAAAACGCC TGAGCTTGTT GGATATGCTT GAGCAATATC CATCCTCAGT GCTTGATTTT
GGCGAATTTT TAGAGCTATT GCCAGCAATG CGAGTGCGCC AATATTCAAT TTCATCGTCA
TCATTAGTCA ATCCAAACCA AGCCAGCCTA ACCGTGGCGG TGGTTGATGC CCCAGCATGG
TCGGGTAAGG GCCAGTTCTA TGGCACGGGT TCGAGCTATT TGGCCCGTTT GCAAGTTGGC
GATCAGATTG CGGTGAGCTT GCGTCAACCA CATATTCCGT TTCGCCCACC GAGCGCCAAC
AGCACACCAT TACTGATGAT TTGTGCAGGC ACTGGTTTAG CGCCATTCCG TGGTTTTATC
CAAGAGCGCG TCGCTCGCCA AGGCCAAGGC GAAGCACTTG GCCCGAATGC CCTGTTTTTT
GGCTGCGACC ATCCTGAGGT TGATCTGCTT TATCACGAAC AGATTCAAGC TTGGCAAAAA
GCTGGAGTGC TAGAATTTTT CCCAGCATTC TATCGCCAGC CAGTTGGTGA AGTCAGCTTT
GTGCAACATC GGCTCTGGCA AGAACGCCAG TATGTGTGGA GCTTAATCGA ACAAGGTGCA
GTAATAGCCG TTTGTGGCGA CGGTCGCTCC ATGGCTCCAG CTGTGCGTGA AACCTTGGCG
CGAATCTATG CCGAAGCAAC TGGCAGCGAG CAAACAGCAG GCATGGCATG GATTGCCGAA
ATCGAGCAAG CAGGACGCTA TGTCGCCGAT GTTTTCGGCT AA
 
Protein sequence
MSISSPIRYI PQPPTRPIVG NVPDIGMETP VQNLMKLAQH YGPIFRVSFP NRSVLVVSSA 
ELVAEISDQQ RFDKLLHGPL IQIRDFAGDG LFTAYTEEAN WSKAHRLLMP AFGPASMRNY
FDDMLDIADQ LFTKWERQGP ETDFDVADNM TRLTLDTIAL CGFGYRFNSF YQREMHPFVE
AMVRALAEAG ARARRLSIQT KLMRSTQRQY EADMQYMHGI TDELIAKRRS LPSNEVPNDL
LGLMLNAKDS ITGEGLDDAN IRNQLVTFLI AGHETTSGLL SFATYFLLQQ PEILQRAQAI
VDQVLGDRLP RYEDLAKLGY LDQILRETLR LWPTAPVFGV YAKHDTNIGG FPIKQGEKFI
ALLPTLHRDP KVWLNPNQFD PDRFAPEVRE QIPEHAWKPF GNGQRACIGR SFAMQEASLV
LAMMLQRFEL SQPQPYQLHV KETLTLKPEG LTVRARVRKN IVRSTKPTQP NVAIQSNPNQ
AQHNIPLLVL YGSNSGSSEA FARRIASDGE ARGYQTSVAA LNNYVNKLPT TGAVSIVAAS
YNGQPADNAQ AFCQWLAGVE PNSLKGVRYS VFGCGNRDWQ STYQAVPTQI DQHLQAAGAE
RLLQRGAADA RSDFFGDFER WYAPFWQTHN QTFAIASAEI NSKPLYKVEL LPSSSDQLAQ
QTGFMFASVL ENRELVDLSS PLGRSKRHIE LRLPNELQYQ AGDYLAILPQ NHPSLIERAC
KHFGLKPEQT IILHATRGAA NLPIDRPISL GELLSSHVEL ATPATQRDLE LLAQKNVCPS
HQIHLAALAA DHERYTTEIL QKRLSLLDML EQYPSSVLDF GEFLELLPAM RVRQYSISSS
SLVNPNQASL TVAVVDAPAW SGKGQFYGTG SSYLARLQVG DQIAVSLRQP HIPFRPPSAN
STPLLMICAG TGLAPFRGFI QERVARQGQG EALGPNALFF GCDHPEVDLL YHEQIQAWQK
AGVLEFFPAF YRQPVGEVSF VQHRLWQERQ YVWSLIEQGA VIAVCGDGRS MAPAVRETLA
RIYAEATGSE QTAGMAWIAE IEQAGRYVAD VFG