Gene Gura_1201 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGura_1201 
Symbol 
ID5163246 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter uraniireducens Rf4 
KingdomBacteria 
Replicon accessionNC_009483 
Strand
Start bp1414832 
End bp1417585 
Gene Length2754 bp 
Protein Length917 aa 
Translation table11 
GC content62% 
IMG OID640548705 
Productbifunctional nitrogenase molybdenum-cofactor biosynthesis protein NifE/NifN 
Protein accessionYP_001229978 
Protein GI148263272 
COG category[C] Energy production and conversion 
COG ID[COG2710] Nitrogenase molybdenum-iron protein, alpha and beta chains 
TIGRFAM ID[TIGR01283] nitrogenase molybdenum-iron cofactor biosynthesis protein NifE
[TIGR01285] nitrogenase molybdenum-iron cofactor biosynthesis protein NifN 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCGAAGC CCGATTACTA CGACGTAACC GAATGCGAAA CCCACGACGC GGGCGCCCCC 
AAGTTCTGCA AAAAATCCGA ACCGGGGGAG GGGACCGAAC GGAGCTGCGC CTACGACGGG
GCCCGGGTGG TGCTGATGCC GATCACCGAC GTCATTCACC TGGTCCACGG CCCCATTGCC
TGTGCCGGCA ACTCCTGGGA CAACCGGGGC GCGCGCTCTT CAGACTCCCA GCTCTACCGC
CGCGGCTTTA CCACCGAGAT GCTGGAAAAC GACGTCATCT TCGGCGGCGA AAAGAAGCTC
TACAAGGCCA TTCTGGAACT GGCCGAACGC TATAAGCCCA AGGCGATCTT CGTCTATGCC
ACCTGCGTCA CCGCCATGAC CGGCGATGAC GTGGAGGCGG TCTGCACAGC GGCGCAGGAA
AAGGTGGCGA TGCCGATAAT TCCCGTGAAT ACGCCGGGGT TTATAGGCGA TAAGAATATC
GGCAACCGGC TGGCCGGCGA GGTGCTGTTC AAGTACGTGA TCGGCACCGC CGAGCCGGAA
TATACTACCG ATTACGACAT CAACCTGATC GGCGAGTACA ACATCGCCGG CGACCTGTGG
GGGATGCTGC CGCTCTTCGA CAAGCTGGGC ATCCGCGTCC TCTCCTGCTT CAGCGGCGAT
GCCAAGTTCG AAGATCTGCG CTACGCCCAC CGGGCAAAGC TGAACGTCAT CATCTGTTCC
AAATCGCTCA CCAACCTGGC GAAGAAGATG CAGAAGACTT ACGGCATGCC TTACCTGGAG
GAATCCTTTT ACGGCATGAC CGATGTTGCC AAGGCCCTGC GCGACATAGC GAGAGAGCTG
GACAATGTCT CCGGCGGTCT GGAAAAACGG GTCATGCAGG AGCGGGTGGA AAGGCTGATC
GAGGAAGAGG AAGCCAGGTG CCGGGAACTG ATAGCACCGT ACCGGGCACG GCTGGAAGGG
AAGCGGGCGG TGCTCTTTAC CGGTGGGGTG AAGACCTGGT CCATGGTCAA CGCCCTGGCC
GAGCTCGGGG TGGAGATCCT CGCTGCCGGC ACCCAGAACT CGACCCTGGA AGACTTCTAC
CGGATGAAGG CCCTGATGCA CAAGGATGCC CGGATCATCG ACGACACCAG CACCGCCGGT
CTCCTTTCGG TCATGTACGA AAAGATGCCG GACCTGATCG TCGCCGGGGG GAAGACCAAG
TTCCTGGCGC TCAAGACCAA GACCCCCTTT CTCGACATCA ACCACGGCCG CTCCCATCCC
TATGCCGGCT ATGATGGCAT GGTCACCTTT GCCAAGCAGC TCGACCTGAC GGTGAACAAC
CCGATCTGGC CCGTGCTGAA CGCCAAGGCG CCGTGGGAGA AGAGCGATGC GGAGCTCAAT
GCCGATGTGG CCCTGGCTGC CGGGCACAGC ACTGCCCATC TCAATGAGGA CATGAAGGAG
TCGCGGGTCA AGGTGCCGAC CAAGAATGCC ACCGTCAACC CGCAGAAGAA CTCCCCGGCC
CTGGGTGCGA CCCTGGCCTA CCTGGGGATC GACCAGATGC TCGGCCTCCT CCATGGCGCC
CAGGGGTGCT CGACCTTCAT CCGCCTCCAG TTGTCGCGGC ATTTCAAGGA GTCGATCGCC
CTCAACTCCA CCAGCATGAG CGAAGACACC GCCATCTTCG GCGGTTGGGA AAACCTGAAG
AAGGGGATCA AGCGGGTGAT CGAGAAGTTC GGCCCGCAGG TGGTGGGGGT GATGACCTCC
GGTCTTACCG AAACCATGGG GGACGACGTG CGGAGCGCCA TTGTCCACTT CCGCCAGGAG
AACCCGGAGT TTGCCCATGT GCCGGTCATC CACGCCTCGA CCCCGGACTA CTGCGGTTCC
ATGCAGGAAG GGTACGCTGC AGCGGTGGAG GCGATCGTTG CCACCATCCC TGAAGGAGGG
GAGAAGATCA AGGGGCAGGT GACCATCCTT CCCGGCTGTC ATCTCACACC GGCCGATGTG
GAAGAGGTGG CGGAGATCTG CGAGGCGTTC GGCCTGACGC CGCTGGTGAT TCCCGATATC
TCCAACGCCC TCGACGGCCA CATCGACGAG ACCGTGTCGC CACTCTCCGT CGGCGGCGTG
ACCCTGGATA AGGTCCGTCT GGCCGGTCGG AGCGAAGCCA CCCTTTACCT GGGGGATTCC
CTGGCCAAGG CCGCCGAAAT ACTGAAAGAG AACTTCGCCA TCCCCTGTTA CGGCTTCACC
TCCATCACCG GCCTGGCCGA GACGGACAGC CTGATGGAGA CCCTTTCCGC CATCGCCGGC
CGTCCCGTTC CGGAAAAGTT ACGCCGCTGG CGGAGCCGCC TCATGGATGC CATGGTCGAT
TGCCACTACC AGTTCGGCCT GAAGCGGATC TCCCTGGCCC TGGAAGCGGA CCTCCTGAAG
ACCATGACCC TCTTCCTCGC CGGGATGGGG TGCCGGATCC AGGCGGCCAT ATCCGCCACC
CGGGTGCGGG GACTTGACCG GCTCCCCACG GACAACATCT TTGTCGGCGA CCTGGAAGAC
CTGGAAAACA GCGCCCAGGG GAGCGATCTC CTGGTGGCCA ATTCCAACGG CCGTCAGGCA
GCCGCCAGAC TGGGGGGGAT ACCGCTTCTG CGGGCCGGGC TGCCGGTGTT CGACCGGTTG
GGGGCCCATC AGAAGATGTA CGTCGGCTAT CGGGGAACCA TGAATCTCGT CTTCGAGACG
GCGACGATTT TCCAGGCCAA TGCAAAGGAA GCGCAGAAAC TGGCGCATAA TTGA
 
Protein sequence
MAKPDYYDVT ECETHDAGAP KFCKKSEPGE GTERSCAYDG ARVVLMPITD VIHLVHGPIA 
CAGNSWDNRG ARSSDSQLYR RGFTTEMLEN DVIFGGEKKL YKAILELAER YKPKAIFVYA
TCVTAMTGDD VEAVCTAAQE KVAMPIIPVN TPGFIGDKNI GNRLAGEVLF KYVIGTAEPE
YTTDYDINLI GEYNIAGDLW GMLPLFDKLG IRVLSCFSGD AKFEDLRYAH RAKLNVIICS
KSLTNLAKKM QKTYGMPYLE ESFYGMTDVA KALRDIAREL DNVSGGLEKR VMQERVERLI
EEEEARCREL IAPYRARLEG KRAVLFTGGV KTWSMVNALA ELGVEILAAG TQNSTLEDFY
RMKALMHKDA RIIDDTSTAG LLSVMYEKMP DLIVAGGKTK FLALKTKTPF LDINHGRSHP
YAGYDGMVTF AKQLDLTVNN PIWPVLNAKA PWEKSDAELN ADVALAAGHS TAHLNEDMKE
SRVKVPTKNA TVNPQKNSPA LGATLAYLGI DQMLGLLHGA QGCSTFIRLQ LSRHFKESIA
LNSTSMSEDT AIFGGWENLK KGIKRVIEKF GPQVVGVMTS GLTETMGDDV RSAIVHFRQE
NPEFAHVPVI HASTPDYCGS MQEGYAAAVE AIVATIPEGG EKIKGQVTIL PGCHLTPADV
EEVAEICEAF GLTPLVIPDI SNALDGHIDE TVSPLSVGGV TLDKVRLAGR SEATLYLGDS
LAKAAEILKE NFAIPCYGFT SITGLAETDS LMETLSAIAG RPVPEKLRRW RSRLMDAMVD
CHYQFGLKRI SLALEADLLK TMTLFLAGMG CRIQAAISAT RVRGLDRLPT DNIFVGDLED
LENSAQGSDL LVANSNGRQA AARLGGIPLL RAGLPVFDRL GAHQKMYVGY RGTMNLVFET
ATIFQANAKE AQKLAHN