Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Francci3_4484 |
Symbol | |
ID | 3907460 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. CcI3 |
Kingdom | Bacteria |
Replicon accession | NC_007777 |
Strand | - |
Start bp | 5353308 |
End bp | 5355035 |
Gene Length | 1728 bp |
Protein Length | 575 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 637881816 |
Product | nitrogenase molybdenum-cofactor biosynthesis protein NifN |
Protein accession | YP_483559 |
Protein GI | 86743159 |
COG category | [C] Energy production and conversion |
COG ID | [COG2710] Nitrogenase molybdenum-iron protein, alpha and beta chains |
TIGRFAM ID | [TIGR01285] nitrogenase molybdenum-iron cofactor biosynthesis protein NifN |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.12413 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 23 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCCGAGA TCGTGACCGG AGAGCGGCAG GCGACGATCG ACCCGCTCAA GCACAGCCAG CCGCTCGGGG GTGCGCTGGT CTTCCTGGGC CTCGCCGGCT CGCTGCCGAT CATGCATGGT GCGCAGGGGT GCGCGTCATT CGCCAAGGCC CTGCTCACCC GGCATTTCAA CGAGCCGATC CCGCTGCAGA CCACCGCGAT CACCGAGGTG ACCGCGGTCC TCGGCAGTGG TGAGGCGATG GTGGAGACGC TCGATGCCAT CCGGAAGAAG CAGCGGCCGG AGATCATCGG CCTGCTCACC ACGGGGGTCA CCGAGGTCAG CGGCGAGGAC GTCGGCGGCC AGCTGCGCAA GTACCTCTCG GCCTGGAACG AGACCGACTC GAGCGGGCCG GAGGCGGGCG CGAACGGCTC CGCCCCCGGC AGGGCCCCGC TGATCGTCGG GGTCTCCACC CCGGACTTCA TCGGGGGGCT GTCCGACGGC TGGTCGGCCG CGTTGGAGGC GCTGGTCCGA GCGGTGGTGC CGGCAGAGCA TTCCGAAACG GAGCCTTCCG AAGCGAGATA TTTTACCGGG CGCACCGCCT TCATCGCGGC CAGCGCTCCC GTAACGCATC TCACTACCCC CGGGTCCGCC GGGTCCGCCG AGGCTGCCAG GTCCACCAGG TCCACCAGGT CCACCAGGTC CGCCAGGTCC CTGGGGTCCG CCGAGGCTGC CAGGTCCACC AGGTCCGCCA GGTCCCTGGG GTCCGCCGCC CGACGGGTGG CGGCGGCGGA CGGTGGTGCT CTGGACCCGC GCCAGCTCGC CGTCCTGGTC GGCCCGTCCC TGACGGCGGC CGACCTCGAC GAGCTCGGTG AGTTGATTCG CGCGTTCGGG CTCGACCCGG TACTCGTCCC CGACCTGTCC GGATCGGTGG ACGGGCACCT GGCCCCCGCC TGGCAGCCCA CGACGACCGG CGGCACCGGG CTCGCGCGGC TGCGGGCGCT CGGTCGGTCG CGGGCCGTGC TCGTCGCGGG TGCGACCGCG GCGGCGGCCG GTGACCTGCT CGCAGCCCGG ACCGGCGCCC GGATCCTCCG CCACCGTCAT CTCAGCGGGC TGACGGAGAT GGACACCTTG GTCACCGAGC TGATCGAACT CACCGGAGCG TCCGCGCCGG CCCGGGTCCG GCAGGCCCGG GCCCGGCTCG CCGACGGTCT GCTCGACACG CACTTCGTCC TCGGTGGCGC CCGGGTCGCG CTGGCAATGG AGCCGGAGAC CCTGGTCGCC GTCGGTTCCC TGCTGCACGA CGTCGGGGCC GAGGTCGTGG CGGCGGTGTC CCCGACCGCC GCGCCGGTCC TCGCGGACGC GCCCTGGGAC GAGGTCGTCG TCGGTGATCT GACCGACCTC GCCGAACGGG CTCGGGCGGG TGGTGCGCAC CTCGTCCTCG GTTCCAGCCA CGCCCGCGAG GTCGCCGACC GGATTGGCGC CGCGCACCTG CCCGTCGGCT TCCCGATCTT CGATCGGCTC GGCGCGGCGC TGGCCGGTAC CGCCGGGTAC GCCGGAAGCC TGCGCCTGCT CATCGACGCC GCGAATCGTC TGCTCGATCA CGAGCACGTG CACCGCCGTT CCGGGCGGAG GTTCTCCGTG TCCGGCGCGG ATGGACAGCC CGCCCACCGG ACCGAGATCC CCACCCAACC CGGCTCCGCG GGTGAATCCG ATCAACTCGA CAACCTGTTC CAGGAGTCTC CGTGTTGA
|
Protein sequence | MAEIVTGERQ ATIDPLKHSQ PLGGALVFLG LAGSLPIMHG AQGCASFAKA LLTRHFNEPI PLQTTAITEV TAVLGSGEAM VETLDAIRKK QRPEIIGLLT TGVTEVSGED VGGQLRKYLS AWNETDSSGP EAGANGSAPG RAPLIVGVST PDFIGGLSDG WSAALEALVR AVVPAEHSET EPSEARYFTG RTAFIAASAP VTHLTTPGSA GSAEAARSTR STRSTRSARS LGSAEAARST RSARSLGSAA RRVAAADGGA LDPRQLAVLV GPSLTAADLD ELGELIRAFG LDPVLVPDLS GSVDGHLAPA WQPTTTGGTG LARLRALGRS RAVLVAGATA AAAGDLLAAR TGARILRHRH LSGLTEMDTL VTELIELTGA SAPARVRQAR ARLADGLLDT HFVLGGARVA LAMEPETLVA VGSLLHDVGA EVVAAVSPTA APVLADAPWD EVVVGDLTDL AERARAGGAH LVLGSSHARE VADRIGAAHL PVGFPIFDRL GAALAGTAGY AGSLRLLIDA ANRLLDHEHV HRRSGRRFSV SGADGQPAHR TEIPTQPGSA GESDQLDNLF QESPC
|
| |