Gene Franean1_6877 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_6877 
Symbol 
ID5675190 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp8379773 
End bp8381242 
Gene Length1470 bp 
Protein Length489 aa 
Translation table11 
GC content74% 
IMG OID641245726 
Productnitrogenase molybdenum-cofactor biosynthesis protein NifN 
Protein accessionYP_001511117 
Protein GI158318609 
COG category[C] Energy production and conversion 
COG ID[COG2710] Nitrogenase molybdenum-iron protein, alpha and beta chains 
TIGRFAM ID[TIGR01285] nitrogenase molybdenum-iron cofactor biosynthesis protein NifN 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.320804 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGAGGG TCGTCACGAG TGACCGCCGG CCCGGCCTGG ACCCGCTGCG GTTCAGCCAG 
CCGCTCGGCG GGGCACTGGT CTTCCTCGGC CTCGCCGCGG CGATGCCCGT CATGCACGGG
TCGAAGGGCT GCGCCTCGTT CGCCAAGGCG CTGCTGACCC GGCACTTCAA CGAGCCCGTC
CCGCTGCAGA CCACCGGTGT CACCGAGGTG TCCGCGGTGC TCGGCAGTGG CGACGACCTC
GTCGCCAACC TGGACGGCAT CCGCGCCAAG CAGAACCCGC GGATCATCGG GCTGCTGACC
ACCGGCGTCA CCGAGGTCAG CGGCGAGGAC GTCGCCGGCC AGGTCCGCCA GTACATCGCG
ATGATGAACC ACACCACCCC CGAGGGCGCG CCGCTGATCG TCCGGGTGTC CACGCCGGAC
TTCGCCGGCG GGCTGTCGGA CGGCTGGTCG GCCGCGCTGC GCTCGCTGGT CGCCACCGTC
CCCTTCGACC ACGCCGACTC GGACGAGTAC CCGGGTACGC GCTCGGGCTT CGGCGCCGGA
ACCGGTTCCG CGCCCGAGAC GGTCGCCGTG CTCGTCGGCC CGTCCCTGTC GGCCGCCGAC
CTCGACGAGC TCTGCGCGCT GATCCGTTCC TTCGGGATGG CGCCGGTGCT GGTCCCGGAT
CTCTCCGGCT CCCTCGACGG GCACCTGGCC CCGTCCTGGC AGCCGACGAC GACCGGTGGC
ACGGGGCTTG CGCAGCTGCG CCGCCTCGAC GAGGCCGGCC TGATCATCAC CGCCGGCGCG
ACCGCCGCGG AGGCCGGCGT CGACCTGGCC GCGCGCACCG CCGCCGACCT CGTCCAGCAC
GACCACCTCA GCGGCCTCGC CGCGGTGGAC AGCCTGGTCG CCGAGCTGAT GACCCGCTCG
GGACGCGGAC CGGCGCCCGA GGTGCGGCGG GCCCGCGCCC GGCTGGCGGA CGGCCTGCTC
GACACCCACT TCGTCCTCGG CGGGGCGCGG ATCGCGCTCG CGATGGAGCC CGAGGCGCTG
GTCGCCGTCG GCTCCCTGCT GCACGACGTC GGCGCGGAGA TCGTCGCGGC GGTGTCGCCG
ACGGACGCTC CCGTGCTCGC CACCGCCCCC TGGGACGAGA TCGTCATCGG CGACCTGACC
GACCTGGAGG AACGCGCCCT CGAAGGCGGC GCGGAACTGC TCATCGGGTC GAGTCACGTC
CGCACGGTCG CCGACCGTAT CGGCGCCGCC CACCTGGCCG TCGGATTCCC GATCTACGAC
CGGCTCGGAT CGGCCCTGCG CACGACCGCC GGGTACGGGG GCAGCCTGCG GCTGCTCGTC
GACGCGGCGA ACCGGCTGCT CGACCACCAC CAGGCGGACC ACCAGGCGAA CCACCGGGCC
GATCACCGCC CGGGGCGCCA CGACGTCCGC GAACATCCGC TCGACTCGTT CGACCAGCTC
GACGTTCTGT GCCAGGAGTC CCCATGTTGA
 
Protein sequence
MARVVTSDRR PGLDPLRFSQ PLGGALVFLG LAAAMPVMHG SKGCASFAKA LLTRHFNEPV 
PLQTTGVTEV SAVLGSGDDL VANLDGIRAK QNPRIIGLLT TGVTEVSGED VAGQVRQYIA
MMNHTTPEGA PLIVRVSTPD FAGGLSDGWS AALRSLVATV PFDHADSDEY PGTRSGFGAG
TGSAPETVAV LVGPSLSAAD LDELCALIRS FGMAPVLVPD LSGSLDGHLA PSWQPTTTGG
TGLAQLRRLD EAGLIITAGA TAAEAGVDLA ARTAADLVQH DHLSGLAAVD SLVAELMTRS
GRGPAPEVRR ARARLADGLL DTHFVLGGAR IALAMEPEAL VAVGSLLHDV GAEIVAAVSP
TDAPVLATAP WDEIVIGDLT DLEERALEGG AELLIGSSHV RTVADRIGAA HLAVGFPIYD
RLGSALRTTA GYGGSLRLLV DAANRLLDHH QADHQANHRA DHRPGRHDVR EHPLDSFDQL
DVLCQESPC