Gene Lferr_1239 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagLferr_1239 
Symbol 
ID6877212 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAcidithiobacillus ferrooxidans ATCC 53993 
KingdomBacteria 
Replicon accessionNC_011206 
Strand
Start bp1205411 
End bp1206886 
Gene Length1476 bp 
Protein Length491 aa 
Translation table11 
GC content53% 
IMG OID642789116 
Productnitrogenase molybdenum-iron protein alpha chain 
Protein accessionYP_002219684 
Protein GI198283363 
COG category[C] Energy production and conversion 
COG ID[COG2710] Nitrogenase molybdenum-iron protein, alpha and beta chains 
TIGRFAM ID[TIGR01282] nitrogenase molybdenum-iron protein alpha chain
[TIGR01862] nitrogenase component I, alpha chain 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.0291496 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCATTA CCGCTGAAGA AACCCGGGAG CAGATTGTCG CCGAGACCAA GACGCGCAAT 
CGTGCCCTGA TTGATGAAGT TCTCAAGGTC TATCCGGAAA AAACCGCCAA GAGACGCGCC
AAACACCTGA ACGTCTTCGA GGACGGTAAA TCGGATTGCG GCGTCAAATC CAATATCAAG
TCGGTGCCCG GCGTGATGAC CATTCGTGGT TGTGCCTATG CGGGTTCAAA GGGCGTCGTC
TGGGGTCCGA TCAAGGACAT GATTCACATT TCCCATGGAC CCGTCGGTTG CGGCCAGTAT
TCCTGGGGCT CGCGACGCAA CTATTATATT GGCACAACGG GTGTGGATAC CTTTGTCACG
ATGCAGTTCA CTTCCGATTT TCAGGAAAAA GATATCGTTT TCGGCGGTGA CAAGAAGCTC
GAAAAAATCA TGGATGAAAT CGAGGACCTC TTCCCGCTGA ACCACGGTAT TACCGTGCAA
TCGGAATGCC CCATCGGCCT GATCGGAGAC GATATCGAAG CGGTGTCCAA AAAGAAGTCC
AAGGAATTTG GTGGAAAAAC CATTGTTCCG GTGCGTTGTG AGGGCTTCCG GGGGGTATCC
CAATCGCTCG GCCACCATAT CGCCAACGAC AGCGTCCGCG ACTATGTCTT CGAGAAACAA
GGCGCGGAAC CCAAGGCCTT TGAAAAGACC CCTTACGATG TCGCCATTAT TGGTGACTAT
AACATCGGCG GCGATGCCTG GAGTTCACGC ATCCTGCTGG AAGAAATGGG TTTGCGGGTG
ATTGCCCAAT GGTCCGGCGA TGGTTCTCTG GCGGAATTGG AGAACACCCC CATGGCCAAA
TTGAATGTCC TGCACTGCTA TCGTTCCATG AACTATATCT CCCGCTACAT GGAAGAAAAG
CATGGCGTGC CGTGGGTCGA GTATAACTTC TTCGGACCTT CCAAAATTGC CGAGTCTTTG
CGGACGATTG CCAGTTATTT TGATGATCAC ATCAAGGAAG GCGCGGAACG GGTTATCGAG
AAGTACAACC GCCTCACCGC AGAAGTGATC GCCAAATACC GGCCGCGCCT GGAAGGCAGA
ACGGTCATGC TTTTTGTCGG GGGATTGCGT CCTCGCCATG TCATCGGTGC CTATGAAGAT
CTCGGCATGA ATGTAGTGGG AACCGGCTAT GAATTCGGCC ACAACGACGA CTATCAGCGT
ACTACCCACG ATGTGAAAGA CGGTACGCTG ATTTATGACG ATGTGACCGG CTACGAGTTT
GAAAAGATGG TGGAAACCAT CCAGCCCGAT CTCGTGGGTT CCGGCATCAA GGAAAAGTAT
GTCTTCCAGA AGATGGGCGT GCCCTTCCGC CAGATGCACT CCTGGGACTA TTCCGGTCCT
TATCACGGCT ATGACGGCTT CGCCATCTTC GCCCGGGATA TGGATATGGC CATCAACAAC
CCGGTGTGGA GTTTGACCAA GACGCCCTGG GAGTAA
 
Protein sequence
MSITAEETRE QIVAETKTRN RALIDEVLKV YPEKTAKRRA KHLNVFEDGK SDCGVKSNIK 
SVPGVMTIRG CAYAGSKGVV WGPIKDMIHI SHGPVGCGQY SWGSRRNYYI GTTGVDTFVT
MQFTSDFQEK DIVFGGDKKL EKIMDEIEDL FPLNHGITVQ SECPIGLIGD DIEAVSKKKS
KEFGGKTIVP VRCEGFRGVS QSLGHHIAND SVRDYVFEKQ GAEPKAFEKT PYDVAIIGDY
NIGGDAWSSR ILLEEMGLRV IAQWSGDGSL AELENTPMAK LNVLHCYRSM NYISRYMEEK
HGVPWVEYNF FGPSKIAESL RTIASYFDDH IKEGAERVIE KYNRLTAEVI AKYRPRLEGR
TVMLFVGGLR PRHVIGAYED LGMNVVGTGY EFGHNDDYQR TTHDVKDGTL IYDDVTGYEF
EKMVETIQPD LVGSGIKEKY VFQKMGVPFR QMHSWDYSGP YHGYDGFAIF ARDMDMAINN
PVWSLTKTPW E