Gene Hhal_0273 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHhal_0273 
Symbol 
ID4711169 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorhodospira halophila SL1 
KingdomBacteria 
Replicon accessionNC_008789 
Strand
Start bp310188 
End bp311654 
Gene Length1467 bp 
Protein Length488 aa 
Translation table11 
GC content66% 
IMG OID639854733 
Productnitrogenase molybdenum-iron protein alpha chain 
Protein accessionYP_001001869 
Protein GI121997082 
COG category[C] Energy production and conversion 
COG ID[COG2710] Nitrogenase molybdenum-iron protein, alpha and beta chains 
TIGRFAM ID[TIGR01282] nitrogenase molybdenum-iron protein alpha chain
[TIGR01862] nitrogenase component I, alpha chain 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.280105 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCAACT TGACGCAGGA AGAGGCCCAG GCGGTGATCG AGGAGGTCCT CGAGGTCTAC 
CCCGAGGCCA CCCGCGCGGA ACGGGACAAG CACCTGGCCG CCGTCGGCCC GGATCAGCAG
CCTAGCAAGT GCGTGACCTC CAACCGCAAG TCGGTCCCCG GGGTGATGAC CCAGCGCGGC
TGCGCCTACG CCGGCTCCAA GGGTGTGGTC TGGGGGCCGA TCAAGGACAT GGTGCACGTC
TCCCACGGCC CGGTGGGTTG TGGTCAGCTC TCCCGGGCCG GCCGGCGCAA CTACTACACC
GGCCATACCG GGGTGGACAC CTTCGGCACC ATCAACTTCA CCTCAGACTT CCAGGAGCGG
GACATCGTCT TTGGTGGCGA CCCGAAGCTG GAGAAGATCG TCGACGAGAT CGAGATGCTC
TTCCCGCTGA ACAAGGGCAT CACCGTGCAG TCGGAGTGCC CCATCGGGCT GATCGGCGAC
GACATCGACT CCGTGGGTCG GCAGGCCACC GAGCGCCTCG GTAAGCCGGT GATCCCGGTG
CGCTGCGAGG GGTTCCGCGG CGTGTCGCAG TCCCTCGGCC ACCACATCGC CAACGACACC
ATCCGCGACC ACGTCCTGGA GAACCGCGAG GGCGACGGCC GCCAGGCCGG GCCCTACGAC
GTGGCCATCG TCGGCGACTA CAACATCGGT GGCGACTGCT GGGCCTCGCG CATCCTGCTC
GAGGAGATGG GCCTGAACGT GGTGGCGCAG TGGTCCGGCG ACGGCACCCT GGCCGAGATG
GAGAACACGC CCAAGGTCCA GCTCAATCTC CTGCACTGCT ACCGCTCCAT GAACTACATC
TGCGAGCACA TGGAGAAGAC CCACGGCATC CCGTGGATGG AGTTCAACTT CTTCGGGCCG
ACGCGGATCG CCCAGAGCCT ACGCGAGATC GCGGCGCAGT TCGACGAGAC CATCCAGGCC
AACGCCGAGC AGGTCATCGC CAAGTACCAG CCGACCATGG AGGCGGTGAT CGCCAAGTAT
CGGCCGCGGC TCGAGGGCAA GAAGGTGATG CTCTACGTCG GCGGGCTGCG CTCGCGCCAC
GTCATCGGGG CCTACGAGGA CCTGGGCATG GAGGTCATCG GCACCGGCTA CGAGTTCGCC
CACGACGACG ACTACGACCG CACCTACCCG GAGCTCAAGG AGGGGACGCT GGTCTACGAC
GACGCCAACG CCTTCGAGCT GGAGCGTTTC ATCGAGCGGG CCCGACCGGA CCTGGTGGCC
GCCGGGATCA AGGAGAAGTA CGTCTTCCAG AAGATGGGTC TGCCGTTCCG GCAGATGCAC
TCCTGGGACT ACTCCGGGCC GTACCACGGC TACGACGGCT TCGCGATCTT CGCCCGGGAT
ATGGACATGA CCCTGAACAA CCCGGTCTGG GATCGCATGA CTCCGCCGTG GAAAGCCACC
GGTGAGCCGG CCTCCAAGGC GGCCTGA
 
Protein sequence
MSNLTQEEAQ AVIEEVLEVY PEATRAERDK HLAAVGPDQQ PSKCVTSNRK SVPGVMTQRG 
CAYAGSKGVV WGPIKDMVHV SHGPVGCGQL SRAGRRNYYT GHTGVDTFGT INFTSDFQER
DIVFGGDPKL EKIVDEIEML FPLNKGITVQ SECPIGLIGD DIDSVGRQAT ERLGKPVIPV
RCEGFRGVSQ SLGHHIANDT IRDHVLENRE GDGRQAGPYD VAIVGDYNIG GDCWASRILL
EEMGLNVVAQ WSGDGTLAEM ENTPKVQLNL LHCYRSMNYI CEHMEKTHGI PWMEFNFFGP
TRIAQSLREI AAQFDETIQA NAEQVIAKYQ PTMEAVIAKY RPRLEGKKVM LYVGGLRSRH
VIGAYEDLGM EVIGTGYEFA HDDDYDRTYP ELKEGTLVYD DANAFELERF IERARPDLVA
AGIKEKYVFQ KMGLPFRQMH SWDYSGPYHG YDGFAIFARD MDMTLNNPVW DRMTPPWKAT
GEPASKAA