Gene Bind_0474 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBind_0474 
Symbol 
ID6200670 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBeijerinckia indica subsp. indica ATCC 9039 
KingdomBacteria 
Replicon accessionNC_010581 
Strand
Start bp534854 
End bp536314 
Gene Length1461 bp 
Protein Length486 aa 
Translation table11 
GC content56% 
IMG OID641704466 
Productnitrogenase molybdenum-iron protein alpha chain 
Protein accessionYP_001831616 
Protein GI182677470 
COG category[C] Energy production and conversion 
COG ID[COG2710] Nitrogenase molybdenum-iron protein, alpha and beta chains 
TIGRFAM ID[TIGR01282] nitrogenase molybdenum-iron protein alpha chain
[TIGR01862] nitrogenase component I, alpha chain 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.387575 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCCTAT CGGCACCCGA AACAATCGAA GAAATCAAGC AGAGGAACAA GGAACTCATC 
GCTGAGGTCC TGGAGGCCTA TCCCGAAAAG AGCAAGAAAA ACCGCGCCAA GCACCTCAAC
CAGTTCCAGG AAGGTGGCAA GGATTGCTCG GTCAAGTCCA ACATCAAGTC CGTCCCGGGC
GTGATGACGA TCCGTGGCTG CGCCTACGCC GGTTCGAAGG GCGTGGTGTG GGGTCCGATC
AAGGACATGA TCCACATCAG CCATGGTCCG GTCGGCTGCG GCCAGTACTC CTGGGCTGCG
CGTCGTAACT ATTACATTGG CACGACCGGT GTTGACACCT TCGTGACCAT GCAGTTCACC
TCGGACTTCC AGGAAAAGGA CATTGTCTTC GGCGGCGACA AGAAGCTCGC GAAGATCATG
GACGAAATCA TGGAATTGTT CCCCCTGAAC CATGGTGTCA CGGTTCAGTC GGAATGCCCG
ATCGGCCTCA TCGGTGACGA CATCGAAGCC GTTTCGAAGC AGAAGTCCAA GGAATATGGC
GGCAAGACCA TTGTTCCGGT CCGCTGCGAA GGCTTCCGTG GCGTTTCCCA GTCTCTCGGC
CACCACATTG CGAACGACGC CGTTCGTGAC TGGGTGTTCG ACAAGATGGA AGGCAAGCCC
GCCCGTATCG AACTCACCGA CTATGACGTT GCCATCATCG GCGACTACAA CATCGGTGGT
GACGCTTGGT CGTCCCGTAT CCTTCTCGAG GAAATGGGCC TCCGCGTGAT CGCTCAGTGG
TCGGGTGACG GTTCCATCGC CGAACTCGAG GCGACGCCGA AGGCGAAGCT CAACGTCCTT
CACTGCTACC GCTCGATGAA CTACATCTCC CGCCACATGG AAGAAAAGTA CGGTGTTCCG
TGGGTGGAAT ATAACTTCTT CGGCCCGTCC AAGATCGCTG AGTCGCTGCG CACGATCGCC
AGCCACTTCG ACGACAAGAT CAAGGAAAAT GCCGAGAAGG TCATCGCCAA GTATCGCGCT
CTGTCCGATG CGGTGATCGA GAAGTATCGT CCGCGTCTCC ATGGCCGTAA GGTCATGCTC
TTCGTCGGCG GTCTGCGTCC GCGTCACGTT ATCGGCGCTT ACGAAGATCT CGGCATGGAA
GTTGTCGGTA CCGGCTATGA GTTCGGCCAT AACGACGACT ATCAGCGCAC CACTCACTAT
GTGAAGGACG GCACGCTGAT CTATGACGAC GTGACCGGCT ACGAGTTCGA GAAGTTCGTC
GAAAAGATCC AGCCTGATCT GGTTGGTTCC GGCATTAAGG AAAAGTACGT CTTCCAGAAA
ATGGGCGTTC CTTTCCGTCA GATGCATTCT TGGGACTATT CGGGCCCGTA TCATGGCTAT
GATGGCTTCG CCATCTTCGC TCGCGATATG GACATGGCCA TCAATTCCCC GGTTTGGGGT
TTGACCAAGG CTCCGTTCTA A
 
Protein sequence
MSLSAPETIE EIKQRNKELI AEVLEAYPEK SKKNRAKHLN QFQEGGKDCS VKSNIKSVPG 
VMTIRGCAYA GSKGVVWGPI KDMIHISHGP VGCGQYSWAA RRNYYIGTTG VDTFVTMQFT
SDFQEKDIVF GGDKKLAKIM DEIMELFPLN HGVTVQSECP IGLIGDDIEA VSKQKSKEYG
GKTIVPVRCE GFRGVSQSLG HHIANDAVRD WVFDKMEGKP ARIELTDYDV AIIGDYNIGG
DAWSSRILLE EMGLRVIAQW SGDGSIAELE ATPKAKLNVL HCYRSMNYIS RHMEEKYGVP
WVEYNFFGPS KIAESLRTIA SHFDDKIKEN AEKVIAKYRA LSDAVIEKYR PRLHGRKVML
FVGGLRPRHV IGAYEDLGME VVGTGYEFGH NDDYQRTTHY VKDGTLIYDD VTGYEFEKFV
EKIQPDLVGS GIKEKYVFQK MGVPFRQMHS WDYSGPYHGY DGFAIFARDM DMAINSPVWG
LTKAPF