Gene Hhal_0272 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHhal_0272 
Symbol 
ID4711168 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorhodospira halophila SL1 
KingdomBacteria 
Replicon accessionNC_008789 
Strand
Start bp308526 
End bp310097 
Gene Length1572 bp 
Protein Length523 aa 
Translation table11 
GC content64% 
IMG OID639854732 
Productnitrogenase molybdenum-iron protein beta chain 
Protein accessionYP_001001868 
Protein GI121997081 
COG category[C] Energy production and conversion 
COG ID[COG2710] Nitrogenase molybdenum-iron protein, alpha and beta chains 
TIGRFAM ID[TIGR01286] nitrogenase molybdenum-iron protein beta chain 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.322626 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACGAAG ACACCGAAGC CATCAAGCCG TGCTACCCGC TGTTCCGGGA TGACGACTAC 
CAGCAGGTTC TCAACCGCAA GCGGCAGGAG CACGAGGAGG GCGTCGATCA GCAGACCATC
GACGAGACCT TCCAGTGGAC GACCACGCCG GAGTACCAGG AGCTGAACTT CTCCCGCGAG
GCGCTGACCG TTGACCCGGG TAAGGTTTGC CAGCCCCTGG GCGCGGTGCT CTGCTCGCTG
GGGTTCGAGA AGACCCTACC CTACGTCCAC GGCTCCCAGG GCTGTGTGGC GTACTTCCGG
ACCTACTTCA ACCGCCACTT CAAGGAGCCG GTGGCCTGCG TCTCCGACTC CATGACCGAG
GACGCGGCGG TCTTCGGCGG TCACAAGAAC ATGAACCAGG GGCTGCAGAA CGCCTACCAG
ATCTACCAGC CCGAGGTGAT CGCCGTCTCC ACCACCTGCA TGGCCGAGGT CATCGGCGAC
GACCTCAACG CCTTCATCGG CAACGCCAAG GACGAGGGGT ATCTGCCCCA GGAGGTCCCG
ACCCCGTTTG CTCACACGCC GAGCTTTGTC GGCAGCCACA TCACCGGCTG GGACGGCATG
TTCGAGGGCC TCATCCGCTA CTTCACCATC AATGAGATGG AAGACAAGCA GCCGGGCAGC
AACGGCAAGC TCAACCTGGT GCCCGGCTTC GAGACCTACC TGGGTAGCTA CCGCTGGGTC
AAGCGCGCCC TGGATGAGAT GGGCGTGGAG GCCTCGGTGC TCAGTGATCC CACCGAGGTG
CTCGACACCC CGGCCGACGG CGAGTACCGG ATGTACGCCG GCGGGACGCC CATCGACGAG
GTCCGCGATG CGCCCAACGC CCTGGACACG CTCCTGCTCC AGCCCTGGGC GCTGACCAAG
ACCAAGAAGT ACGTCAAGCA GACCTGGAAG CACGACTGCC CCGAGGTGCA GGTGCCCATC
GGCCTCGAGG CGACGGACAA CTTCCTGATG CAGGTCTCGC AGATCACCGG CAAGCCGATC
CCCCCGTCCC TGGAGAAGGA GCGCGGGCGG CTGGTGGACA TGATGACCGA CTCCCACCAG
TGGCTGCACG GCAAGCGCTT CGCCCTGTGG GGGGATCCGG ACTTCGTCAT GGGCATGACC
CGCTTCCTCA CCGAGCTCGG CGCCGAACCG CGCCACGTGC TCTGCCACAA CGCCAACAAG
CGCTGGGGCA AGGCGATGCG CGCCATGCTC GAGCAGACGC CCTACGGGGA GAGCGCCCAG
GTCTACGTCG GCAAGGATCT GTGGCACATG CGTTCGCTGT GCTTCACCGA CAAGCCCGAC
TTCATGATCG GCAACTCCTA CGGCAAGTTC ATCCAGCGCG ACACCCTCTA CAAGGGCAAG
GAGTTCGAGG TCCCGCTGAT CCGCCTCGGC TTCCCGATCT TCGACCGGCA CCACCTGCAC
CGCAACAGCA TCTGGGGCTA TGAGGGGGCG ATGTACATCC TCACCACCAT CGTCAACGAG
GTGCTGGACC GGCTCGACGA GGAGACCCGG CACATGGGTG TCACCGACTT CAACTACGAC
CTGGTCCGCT GA
 
Protein sequence
MNEDTEAIKP CYPLFRDDDY QQVLNRKRQE HEEGVDQQTI DETFQWTTTP EYQELNFSRE 
ALTVDPGKVC QPLGAVLCSL GFEKTLPYVH GSQGCVAYFR TYFNRHFKEP VACVSDSMTE
DAAVFGGHKN MNQGLQNAYQ IYQPEVIAVS TTCMAEVIGD DLNAFIGNAK DEGYLPQEVP
TPFAHTPSFV GSHITGWDGM FEGLIRYFTI NEMEDKQPGS NGKLNLVPGF ETYLGSYRWV
KRALDEMGVE ASVLSDPTEV LDTPADGEYR MYAGGTPIDE VRDAPNALDT LLLQPWALTK
TKKYVKQTWK HDCPEVQVPI GLEATDNFLM QVSQITGKPI PPSLEKERGR LVDMMTDSHQ
WLHGKRFALW GDPDFVMGMT RFLTELGAEP RHVLCHNANK RWGKAMRAML EQTPYGESAQ
VYVGKDLWHM RSLCFTDKPD FMIGNSYGKF IQRDTLYKGK EFEVPLIRLG FPIFDRHHLH
RNSIWGYEGA MYILTTIVNE VLDRLDEETR HMGVTDFNYD LVR