Gene Lferr_1235 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagLferr_1235 
Symbol 
ID6877208 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAcidithiobacillus ferrooxidans ATCC 53993 
KingdomBacteria 
Replicon accessionNC_011206 
Strand
Start bp1201588 
End bp1202979 
Gene Length1392 bp 
Protein Length463 aa 
Translation table11 
GC content55% 
IMG OID642789112 
Productnitrogenase molybdenum-cofactor biosynthesis protein NifE 
Protein accessionYP_002219680 
Protein GI198283359 
COG category[C] Energy production and conversion 
COG ID[COG2710] Nitrogenase molybdenum-iron protein, alpha and beta chains 
TIGRFAM ID[TIGR01283] nitrogenase molybdenum-iron cofactor biosynthesis protein NifE 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.00336463 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGTTGCAGA ACAAAATCCA GGATGTTTTT AATGAACCGG GATGCAGCAA GAACCAGAGC 
AAGTCCGACA AAGAGCGCAA GAAAGGCTGC ACCAAGGCGC TGCAGCCGGG CGGCGCGGCC
GGTGGTTGCG CTTTCGATGG GGCAAAGATT GCCTTGCAAC CCATTACCGA CGTCGCCCAC
CTGGTCCACG GTCCCATCGC CTGTGAGGGC AACTCCTGGG ATAACCGCGG CTCCAAATCG
TCTGGTTCGC AACTGTATCG CACCGGCTTT ACCACCGACA TCAATGAACT GGATGTGGTC
TACGGCGGCG AAAAACACCT CTTCAAATCC ATCAAGGAAG TACTTGATAA ATACGATCCG
TCGGCGGTAT TTGTCTATCA GACCTGCGTG ACGGCAATGA TCGGAGACGA TATCGAATCC
GTCTGTAAAG CGGCCAGTCA AAAATTCGCA AAGCCCATTA TTCCGGTCAA TGCCCCCGGT
TTTGTCGGCG CCAAGAATCT CGGCAACAAA CTGGCGGGAG AGGCCCTGCT GGATTATGTG
ATCGGCACGG AGGAGCCAGA ATATAGTACG CCCTATGACA TCAATATCAT TGGCGAATAC
AATCTTTCCG GTGAACTCTG GCAGGTCAAA CCACTTCTCG ATCATTTGGG GATCCGGGTA
ACCTGTTGCA TCAGCGGTGA CGCCAAATAT CACGACGTGG CGCAATCCCA TCGTGCCAGG
GCCAATATGA TGGTCTGCTC TAAATCCATG ATCAATATCG CCCGCAAAAT GGAAGAGCGT
TATCAGATTC CGTTCTTTGA AGGGTCCTTT TATGGCATCT CCGATACCAC GGAGTCGCTC
CGGGAGATCA CCCGCCTGCT GATCCAGCAG GGTGCCCCGG CAGAGCTCCA CGACCGCACC
GAAGCGCTGA TCGCCCGGGA AGAGGCAAGG GCCTGGCAAC GTATCGCCGA ATACACCCAT
CGCCTGCGCG GCAAACGGGT GTTGCTCTTT ACGGGGGGCG TCAAATCCTG GTCGGTGGTA
TCTGCATTGC AGGAAGGCGG GATGGAAGTG GTGGGAACCA GCGTCAAGAA ATCCACCAGG
GAGGACAAGG AAAGAATCAA GGAAATCATG GGTCAGGATG CGCATATGCT GGATGACCTG
ACCCCTCGGG AAATGTACAA AATGTTTCAG GAGGCGCGTG CGGATGTGTT GCTGTCGGGC
GGACGTTCAC AATTTGCGGC CCTCAAAAAC AAAATGCCCT GGGTGGACAT CAACCAGGAA
CGCCATCAGG CCTATAACGG TTATGAAGGG ATGGTCAACC TGGTGAAACA GATCGATTTG
GCCCTCTACA ATCCCATGTG GGCCTTGTTG CGCAAACCCG CGCCCTGGGA TATGGGGGAG
GCACGGACAT GA
 
Protein sequence
MLQNKIQDVF NEPGCSKNQS KSDKERKKGC TKALQPGGAA GGCAFDGAKI ALQPITDVAH 
LVHGPIACEG NSWDNRGSKS SGSQLYRTGF TTDINELDVV YGGEKHLFKS IKEVLDKYDP
SAVFVYQTCV TAMIGDDIES VCKAASQKFA KPIIPVNAPG FVGAKNLGNK LAGEALLDYV
IGTEEPEYST PYDINIIGEY NLSGELWQVK PLLDHLGIRV TCCISGDAKY HDVAQSHRAR
ANMMVCSKSM INIARKMEER YQIPFFEGSF YGISDTTESL REITRLLIQQ GAPAELHDRT
EALIAREEAR AWQRIAEYTH RLRGKRVLLF TGGVKSWSVV SALQEGGMEV VGTSVKKSTR
EDKERIKEIM GQDAHMLDDL TPREMYKMFQ EARADVLLSG GRSQFAALKN KMPWVDINQE
RHQAYNGYEG MVNLVKQIDL ALYNPMWALL RKPAPWDMGE ART