Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Gura_1201 |
Symbol | |
ID | 5163246 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter uraniireducens Rf4 |
Kingdom | Bacteria |
Replicon accession | NC_009483 |
Strand | + |
Start bp | 1414832 |
End bp | 1417585 |
Gene Length | 2754 bp |
Protein Length | 917 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 640548705 |
Product | bifunctional nitrogenase molybdenum-cofactor biosynthesis protein NifE/NifN |
Protein accession | YP_001229978 |
Protein GI | 148263272 |
COG category | [C] Energy production and conversion |
COG ID | [COG2710] Nitrogenase molybdenum-iron protein, alpha and beta chains |
TIGRFAM ID | [TIGR01283] nitrogenase molybdenum-iron cofactor biosynthesis protein NifE [TIGR01285] nitrogenase molybdenum-iron cofactor biosynthesis protein NifN |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCGAAGC CCGATTACTA CGACGTAACC GAATGCGAAA CCCACGACGC GGGCGCCCCC AAGTTCTGCA AAAAATCCGA ACCGGGGGAG GGGACCGAAC GGAGCTGCGC CTACGACGGG GCCCGGGTGG TGCTGATGCC GATCACCGAC GTCATTCACC TGGTCCACGG CCCCATTGCC TGTGCCGGCA ACTCCTGGGA CAACCGGGGC GCGCGCTCTT CAGACTCCCA GCTCTACCGC CGCGGCTTTA CCACCGAGAT GCTGGAAAAC GACGTCATCT TCGGCGGCGA AAAGAAGCTC TACAAGGCCA TTCTGGAACT GGCCGAACGC TATAAGCCCA AGGCGATCTT CGTCTATGCC ACCTGCGTCA CCGCCATGAC CGGCGATGAC GTGGAGGCGG TCTGCACAGC GGCGCAGGAA AAGGTGGCGA TGCCGATAAT TCCCGTGAAT ACGCCGGGGT TTATAGGCGA TAAGAATATC GGCAACCGGC TGGCCGGCGA GGTGCTGTTC AAGTACGTGA TCGGCACCGC CGAGCCGGAA TATACTACCG ATTACGACAT CAACCTGATC GGCGAGTACA ACATCGCCGG CGACCTGTGG GGGATGCTGC CGCTCTTCGA CAAGCTGGGC ATCCGCGTCC TCTCCTGCTT CAGCGGCGAT GCCAAGTTCG AAGATCTGCG CTACGCCCAC CGGGCAAAGC TGAACGTCAT CATCTGTTCC AAATCGCTCA CCAACCTGGC GAAGAAGATG CAGAAGACTT ACGGCATGCC TTACCTGGAG GAATCCTTTT ACGGCATGAC CGATGTTGCC AAGGCCCTGC GCGACATAGC GAGAGAGCTG GACAATGTCT CCGGCGGTCT GGAAAAACGG GTCATGCAGG AGCGGGTGGA AAGGCTGATC GAGGAAGAGG AAGCCAGGTG CCGGGAACTG ATAGCACCGT ACCGGGCACG GCTGGAAGGG AAGCGGGCGG TGCTCTTTAC CGGTGGGGTG AAGACCTGGT CCATGGTCAA CGCCCTGGCC GAGCTCGGGG TGGAGATCCT CGCTGCCGGC ACCCAGAACT CGACCCTGGA AGACTTCTAC CGGATGAAGG CCCTGATGCA CAAGGATGCC CGGATCATCG ACGACACCAG CACCGCCGGT CTCCTTTCGG TCATGTACGA AAAGATGCCG GACCTGATCG TCGCCGGGGG GAAGACCAAG TTCCTGGCGC TCAAGACCAA GACCCCCTTT CTCGACATCA ACCACGGCCG CTCCCATCCC TATGCCGGCT ATGATGGCAT GGTCACCTTT GCCAAGCAGC TCGACCTGAC GGTGAACAAC CCGATCTGGC CCGTGCTGAA CGCCAAGGCG CCGTGGGAGA AGAGCGATGC GGAGCTCAAT GCCGATGTGG CCCTGGCTGC CGGGCACAGC ACTGCCCATC TCAATGAGGA CATGAAGGAG TCGCGGGTCA AGGTGCCGAC CAAGAATGCC ACCGTCAACC CGCAGAAGAA CTCCCCGGCC CTGGGTGCGA CCCTGGCCTA CCTGGGGATC GACCAGATGC TCGGCCTCCT CCATGGCGCC CAGGGGTGCT CGACCTTCAT CCGCCTCCAG TTGTCGCGGC ATTTCAAGGA GTCGATCGCC CTCAACTCCA CCAGCATGAG CGAAGACACC GCCATCTTCG GCGGTTGGGA AAACCTGAAG AAGGGGATCA AGCGGGTGAT CGAGAAGTTC GGCCCGCAGG TGGTGGGGGT GATGACCTCC GGTCTTACCG AAACCATGGG GGACGACGTG CGGAGCGCCA TTGTCCACTT CCGCCAGGAG AACCCGGAGT TTGCCCATGT GCCGGTCATC CACGCCTCGA CCCCGGACTA CTGCGGTTCC ATGCAGGAAG GGTACGCTGC AGCGGTGGAG GCGATCGTTG CCACCATCCC TGAAGGAGGG GAGAAGATCA AGGGGCAGGT GACCATCCTT CCCGGCTGTC ATCTCACACC GGCCGATGTG GAAGAGGTGG CGGAGATCTG CGAGGCGTTC GGCCTGACGC CGCTGGTGAT TCCCGATATC TCCAACGCCC TCGACGGCCA CATCGACGAG ACCGTGTCGC CACTCTCCGT CGGCGGCGTG ACCCTGGATA AGGTCCGTCT GGCCGGTCGG AGCGAAGCCA CCCTTTACCT GGGGGATTCC CTGGCCAAGG CCGCCGAAAT ACTGAAAGAG AACTTCGCCA TCCCCTGTTA CGGCTTCACC TCCATCACCG GCCTGGCCGA GACGGACAGC CTGATGGAGA CCCTTTCCGC CATCGCCGGC CGTCCCGTTC CGGAAAAGTT ACGCCGCTGG CGGAGCCGCC TCATGGATGC CATGGTCGAT TGCCACTACC AGTTCGGCCT GAAGCGGATC TCCCTGGCCC TGGAAGCGGA CCTCCTGAAG ACCATGACCC TCTTCCTCGC CGGGATGGGG TGCCGGATCC AGGCGGCCAT ATCCGCCACC CGGGTGCGGG GACTTGACCG GCTCCCCACG GACAACATCT TTGTCGGCGA CCTGGAAGAC CTGGAAAACA GCGCCCAGGG GAGCGATCTC CTGGTGGCCA ATTCCAACGG CCGTCAGGCA GCCGCCAGAC TGGGGGGGAT ACCGCTTCTG CGGGCCGGGC TGCCGGTGTT CGACCGGTTG GGGGCCCATC AGAAGATGTA CGTCGGCTAT CGGGGAACCA TGAATCTCGT CTTCGAGACG GCGACGATTT TCCAGGCCAA TGCAAAGGAA GCGCAGAAAC TGGCGCATAA TTGA
|
Protein sequence | MAKPDYYDVT ECETHDAGAP KFCKKSEPGE GTERSCAYDG ARVVLMPITD VIHLVHGPIA CAGNSWDNRG ARSSDSQLYR RGFTTEMLEN DVIFGGEKKL YKAILELAER YKPKAIFVYA TCVTAMTGDD VEAVCTAAQE KVAMPIIPVN TPGFIGDKNI GNRLAGEVLF KYVIGTAEPE YTTDYDINLI GEYNIAGDLW GMLPLFDKLG IRVLSCFSGD AKFEDLRYAH RAKLNVIICS KSLTNLAKKM QKTYGMPYLE ESFYGMTDVA KALRDIAREL DNVSGGLEKR VMQERVERLI EEEEARCREL IAPYRARLEG KRAVLFTGGV KTWSMVNALA ELGVEILAAG TQNSTLEDFY RMKALMHKDA RIIDDTSTAG LLSVMYEKMP DLIVAGGKTK FLALKTKTPF LDINHGRSHP YAGYDGMVTF AKQLDLTVNN PIWPVLNAKA PWEKSDAELN ADVALAAGHS TAHLNEDMKE SRVKVPTKNA TVNPQKNSPA LGATLAYLGI DQMLGLLHGA QGCSTFIRLQ LSRHFKESIA LNSTSMSEDT AIFGGWENLK KGIKRVIEKF GPQVVGVMTS GLTETMGDDV RSAIVHFRQE NPEFAHVPVI HASTPDYCGS MQEGYAAAVE AIVATIPEGG EKIKGQVTIL PGCHLTPADV EEVAEICEAF GLTPLVIPDI SNALDGHIDE TVSPLSVGGV TLDKVRLAGR SEATLYLGDS LAKAAEILKE NFAIPCYGFT SITGLAETDS LMETLSAIAG RPVPEKLRRW RSRLMDAMVD CHYQFGLKRI SLALEADLLK TMTLFLAGMG CRIQAAISAT RVRGLDRLPT DNIFVGDLED LENSAQGSDL LVANSNGRQA AARLGGIPLL RAGLPVFDRL GAHQKMYVGY RGTMNLVFET ATIFQANAKE AQKLAHN
|
| |