Gene Franean1_6879 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_6879 
Symbol 
ID5675192 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp8382943 
End bp8384499 
Gene Length1557 bp 
Protein Length518 aa 
Translation table11 
GC content68% 
IMG OID641245728 
Productnitrogenase molybdenum-iron protein beta chain 
Protein accessionYP_001511119 
Protein GI158318611 
COG category[C] Energy production and conversion 
COG ID[COG2710] Nitrogenase molybdenum-iron protein, alpha and beta chains 
TIGRFAM ID[TIGR01286] nitrogenase molybdenum-iron protein beta chain 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.640197 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.320804 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACGACGA CTCCCGAGAC AGGCAGTTCC ATTCCGTTGC GGGTCCTCGA CCACAGCGAG 
CTGTTCAAGG ACGAGGTCTA CCAGAAGCAG TTCGAGGGAA AGACCGAGTT CGAGAACGGC
AGTGACTCCG CCGAGGTTGC CCGCGTCCTC GAGTGGACCC GCGGCTGGGA GTACCGGGAG
AAGAACTTCG CCCGGGAGGC GCTGACCGTC AACCCGGCGA AGGCCTGCCA GCCGCTCGGT
GCGGTGCTCG CGGGCCTCGG GTTCCAGGGC ACGCTGCCGC TCGTGCACGG TTCGCAGGGC
TGCGTCGCGT ACTTCCGCAG CCACTTCGCT CGGCACTTCA AGGAGCCCGT CCCCGCGGCA
TCCACGTCGA TGACCGAGGA CGCGGCGGTC TTCGGCGGCC TGAACAACCT GGTCGAGGCG
CTGGAGAACG CGACCAGCCT GTACAAGCCG AAGATGGTCG CGGTCAGCAC CACCTGCATG
GCCGAGGTCA TCGGTGAGGA CCTCTTCGCC TACATCGGCG CGGCCAAGGA GAAGGAGGTG
ATCTCCACCG ACTACCCGGT TCCCTACGCC CACACCCCGA GCTTCGTGGG CTCGCACATC
ACCGGGTACG ACAGCATGCT CAAGGGAATC CTTGAGAACC TGACGAAGTC GGCGGACGCG
ACGGAGCCGA AGGCTGGTGG GAAGCCCCGG CTGAACATCA TCCCCGGTTT CGAGACCTAC
ACCGGTAACC TCCGCGAGTA CCGGCGCGTG CTCGAGCTCA TGGGCGTGGA CCCGCTGATC
CTCGGCGACC ACGCCGACTC GCTCGACTCG CCGGCCGACG GGGAGTACGA CCTCTACCCC
GGTGGCACGC CGCTGGCCGA GGCGGCGAAG GCGAAGTTCA GCCGCGCCAC CGTGCTGCTG
CAGGAGTCCG CCACCCGCAA GACCACCGAG CTGATCCGGG ACGTGTGGAA GCAGGACACG
CTGGTGCTGG AGACCCCGAT CGGGGTCCGC GGCACCGACC AGTTCCTGAC CGAGATCGCC
CGGCTGGCGG GCGTCGAGAT CCCGGCCGAG CTCACCGTCG AGCGCGGCCG TCTCGTTGAC
GCCCTGACGG ACTCGCACGC CTACCTCCAC GGCAAGAGGG TCGCCATCGC CGGCGACCCG
GACCTCGTCG TGGCGCTGAC CCGCTTCGTG CTCGAGCTCG GCATGATCCC GGTGCACGTG
CTCAGCACGA ACGCCGACAC CACCTTCAAG GCCCGCATGG AGAAGGTGCT CTCGGCGAGC
AAGTTCGGCG AGGCGGCCAC CGTCTGGCCG GAGAAGGACC TGTGGCACCT GCGGTCGCTG
GTCTTCACCG AGCCGGTCGA CCTGCTCATC GGCAGCACCT ACCTGAAGTA CATCTCCCGG
GAGGCGAACG TTCCGCTGGT GCGGGTCGGG TTCCCGATCT TCGACCGGCA CCACCTGCAC
CGCTTCCCGA TCGTCGGTTA CACCGGCGGG CTGCACCTGC TCACGCAGCT CGTGAACACC
GTGCTGGACG AGCTTGACCG GACCAGCCCG GACCATAGCT ACGACGCCGT GCGCTAG
 
Protein sequence
MTTTPETGSS IPLRVLDHSE LFKDEVYQKQ FEGKTEFENG SDSAEVARVL EWTRGWEYRE 
KNFAREALTV NPAKACQPLG AVLAGLGFQG TLPLVHGSQG CVAYFRSHFA RHFKEPVPAA
STSMTEDAAV FGGLNNLVEA LENATSLYKP KMVAVSTTCM AEVIGEDLFA YIGAAKEKEV
ISTDYPVPYA HTPSFVGSHI TGYDSMLKGI LENLTKSADA TEPKAGGKPR LNIIPGFETY
TGNLREYRRV LELMGVDPLI LGDHADSLDS PADGEYDLYP GGTPLAEAAK AKFSRATVLL
QESATRKTTE LIRDVWKQDT LVLETPIGVR GTDQFLTEIA RLAGVEIPAE LTVERGRLVD
ALTDSHAYLH GKRVAIAGDP DLVVALTRFV LELGMIPVHV LSTNADTTFK ARMEKVLSAS
KFGEAATVWP EKDLWHLRSL VFTEPVDLLI GSTYLKYISR EANVPLVRVG FPIFDRHHLH
RFPIVGYTGG LHLLTQLVNT VLDELDRTSP DHSYDAVR