Gene Haur_2333 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_2333 
Symbol 
ID5734205 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp2978663 
End bp2980141 
Gene Length1479 bp 
Protein Length492 aa 
Translation table11 
GC content52% 
IMG OID641279474 
Productferredoxin--nitrite reductase 
Protein accessionYP_001545101 
Protein GI159898854 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0155] Sulfite reductase, beta subunit (hemoprotein) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCTGTTC AAATTGAAAC CATCAAAAAG GTCAAAAATG GGCTTGATGT CTTGCCTGAT 
CTGTATCGCT ATGCTCGCTT AGGCTTCGAT GCGATTCCCG AAGATGAGCT TGAGCGCTTG
AAATGGTATG GATTGTTACA CCGCAAGCAA ACGCCTGGCT TCTTTATGCA ACGCCTGCGC
ATTCCCAATG GCATTCTGAG CACGCGCCAA ATGCGGGCAA TCGTCAGCAT CTCGCGCGAT
TTTGGCCGCA ACACCATGGA TTTGACCACC CGCGAAAATA TTCAATTGCG CTGGTTGCGC
ATCGAAGATG TGCCAGAAGT GTTTCAACGC TTGCAAAATG TTGGCCTAAC ATCCCAACAA
ACTGGGCTTG ATAACTATCG CAATGTGATG GGCTGCCCCT TGGCTGGCCT GCACCACGCA
GAAATTTTCA ACGCTGCCCC CATTGCCCAA AGCGTTTCGT TGGCCTTGCT TGGCCGCGAG
TTCAGCGATT TGCCGCGCAA ATTCAATATT ACGATCAGCG GTTGCTCGCA CGATTGTGCC
CATAGCCGAG CCAACGATAT TGGCATGACT CCTGCCGCCA AAGAAATTAA TGGCTATCGT
GTGCTCGGTT TCCACGTAGC GCTGGGCGGA GCATTGGGCG GCACATCGCC GCAACTGGGC
CAAGATGCAG GCATCTTCTT AACCACTGAA CAAGCCTTGC CTTTCTGTCG CGCGGTCTTG
ACGGTGTTCC GCGACAATGG CTCACGCGAA AAACGCACCG AAGCCCGCTT GAAATGGCTG
ATTCGCGAAT GGGGCATGCC GCGCTTTATG GCCGAAGTTG AAAAGGTGTT TGGTCAAGCC
TTCTTCAGCG CTGGCGAATC GTTGTTGATT GAGCATAGCG GCGACCACTT GGGCATTCAT
CAACAACAAG AGGCTGGCTT TGTAACGGTT GGTTTATTAG TGCCAGTTGG TCGAACTAAC
GCCGAGCAAA TGGTCGAAAT CGCTGATTTG GCCGATGCCT ATGGCACTGG CGAACTGCGC
TTGACCCCCG ATCAAAACAT TCTGATTCCG AATGTGCACG AAACCTGCCT CGAACGCTTG
TTGGCTGAGC CATTGCTACA AGTGCTGCAA CCACATGCAC CTGGGGCGTT GCGCGGTTTG
GTCAGTTGTA CAGGTCGCGA TTATTGCCAC TTTGCCTTGA GCGATACCAA AGAATTATCG
TTGCAAGTTG CCACCGAATT GGCCAGCTTA ATTCCTGCTG AACGCCGCGT CGATTTGAAA
GTTTCGGGCT GCGTACACGC CTGTGGTCAG CATCACGTTG GCGAAATTGG TTTGCAAGCT
CAGCGTTTGC GGCTCGAAGA TGGCACAATC GTCGATGCTT TCGATCTATT TGTTGGTGGC
AACCATCAGC AACTGGCAAC CCTTAAAGCT CGCAAAATCC CAGTTGATCA ATTGGCTGGG
CGGATTGCTG CTGAACTCGC CGTTATGGAT GGGGAATAA
 
Protein sequence
MAVQIETIKK VKNGLDVLPD LYRYARLGFD AIPEDELERL KWYGLLHRKQ TPGFFMQRLR 
IPNGILSTRQ MRAIVSISRD FGRNTMDLTT RENIQLRWLR IEDVPEVFQR LQNVGLTSQQ
TGLDNYRNVM GCPLAGLHHA EIFNAAPIAQ SVSLALLGRE FSDLPRKFNI TISGCSHDCA
HSRANDIGMT PAAKEINGYR VLGFHVALGG ALGGTSPQLG QDAGIFLTTE QALPFCRAVL
TVFRDNGSRE KRTEARLKWL IREWGMPRFM AEVEKVFGQA FFSAGESLLI EHSGDHLGIH
QQQEAGFVTV GLLVPVGRTN AEQMVEIADL ADAYGTGELR LTPDQNILIP NVHETCLERL
LAEPLLQVLQ PHAPGALRGL VSCTGRDYCH FALSDTKELS LQVATELASL IPAERRVDLK
VSGCVHACGQ HHVGEIGLQA QRLRLEDGTI VDAFDLFVGG NHQQLATLKA RKIPVDQLAG
RIAAELAVMD GE