Gene Francci3_4486 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_4486 
Symbol 
ID3907462 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp5356614 
End bp5358170 
Gene Length1557 bp 
Protein Length518 aa 
Translation table11 
GC content68% 
IMG OID637881818 
Productnitrogenase molybdenum-iron protein beta chain 
Protein accessionYP_483561 
Protein GI86743161 
COG category[C] Energy production and conversion 
COG ID[COG2710] Nitrogenase molybdenum-iron protein, alpha and beta chains 
TIGRFAM ID[TIGR01286] nitrogenase molybdenum-iron protein beta chain 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACGACGA CTCCGGAAAC AAACAGTTCC GTCCCGCTGC GGGTCCTCGA CCACAGCGAG 
CTGTTCAAGG ACGAGGTCTA CCAGAAGCAG TTCGAGGGCA AGCGCGAGTT CGAGAACGGC
AGCGACGCCG CCGAGGTCGC CCGGGTCCTG GAGTGGACCC GCGGCTGGGA GTACCGGGAG
AAGAACTTCG CCCGGGAGGC GCTGACCGTC AACCCGGCCA AGGCGTGCCA GCCGCTCGGC
GCGGTCCTGG CGGGCCTCGG CTTCGAGGGC ACCATGCCCC TGGTGCACGG CTCGCAGGGC
TGCGTCGCCT ACTTCCGCAG CCACTTCGCC CGGCACTTCA AGGAGCCCGT CCCGGCGGCA
TCCTCGTCGA TGACCGAAGA CGCCGCCGTC TTCGGCGGGC TGAACAACCT GGTCGAGGCG
CTCGAGAACG CGACCACCCT GTACAAGCCC AAGATGGTCG CGGTCAGCAC CACCTGCATG
GCCGAGGTGA TCGGGGAGGA CCTGTTCGCC TACATCGGCG CGGCGCGGGA CAAGGGCGCG
ATCTCCGCCG ACTACCCGGT GCCGTACGCG CACACGCCGA GCTTCGTGGG CTCGCACCTC
ACCGGCTACG ACAGCATGCT CAAGGGCATC CTGGAGCTCC TGAGCAGGAC AGCGGACGCC
GACAAGGTCG AGTCCCCCGG GAAGCCCCGG CTGAACATCG TGCCCGGCTT CGAGACCTAC
GTCGGCAACC ACCGCGAGTA CCGCCGCATC CTCGAGCTGA TGGGTGTCGA CCCCCTGATC
CTGGGCGACC ACAGCAGCTC CCTGGACTCG CCGGCGACGG GTGAGTACGA GCTCTACCCA
GGTGGTACCC CGCTCGCCGA GGCCGCCACG GCTCGGTTCA GCCGCGCTAC CGTGATGCTT
CAGGAGACCA CCACCCGCAA GACGACGGAG TTCGTCCGGG ACGGGTGGAA GCAGGAGACG
GTGGTGCTCG AGACCCCTAT CGGGGTGAAG AACACCGACC GCTTCCTGAC TGAGGTGGCC
CGCCTGGCGG AGGTCGAGAT CCCGGCCGAG CTCACCGCCG AGCGTGGCCG CCTGGTGGAC
GCGCTCACCG ACTCGCATGC CTATCTGCAC GGCAAGCGGG TCGCCATCGC GGGTGACCCT
GACCTCGTCA TCGCGATGAC CGGCTTCGCC CTCGAACTCG GGATGATCCC GGTCCACCTG
CTGAGCACGA ACGCCGACAA CTCCTTCGCG CCCAGGCTGG AGAAGGTCCT CTCGACTAGC
AAGTTCGGCG AGGCGGCGAC CGTCTGGCCG GGCAAGGACC TCTGGCACCT GCGGTCGCTG
GTCTTCACCG AGCCGGTCGA CCTGCTCATT GGGAGCTCGT ACCTGAAGTA CATCTCCCGG
GAGGCGAACG TGCCGCTGGT CCGGGTCGGT TTCCCGATCT TCGACCGGCA CCACCTGCAC
CGCTTCCCGA TCGTCGGCTA CACCGGCGGT CTGCACCTGC TCACCCAGCT CGTCAACACG
GTGCTCGACG AGTTGGACCG CACCAGCCCC GACCACAGCT TCGACGTCGT CCGCTGA
 
Protein sequence
MTTTPETNSS VPLRVLDHSE LFKDEVYQKQ FEGKREFENG SDAAEVARVL EWTRGWEYRE 
KNFAREALTV NPAKACQPLG AVLAGLGFEG TMPLVHGSQG CVAYFRSHFA RHFKEPVPAA
SSSMTEDAAV FGGLNNLVEA LENATTLYKP KMVAVSTTCM AEVIGEDLFA YIGAARDKGA
ISADYPVPYA HTPSFVGSHL TGYDSMLKGI LELLSRTADA DKVESPGKPR LNIVPGFETY
VGNHREYRRI LELMGVDPLI LGDHSSSLDS PATGEYELYP GGTPLAEAAT ARFSRATVML
QETTTRKTTE FVRDGWKQET VVLETPIGVK NTDRFLTEVA RLAEVEIPAE LTAERGRLVD
ALTDSHAYLH GKRVAIAGDP DLVIAMTGFA LELGMIPVHL LSTNADNSFA PRLEKVLSTS
KFGEAATVWP GKDLWHLRSL VFTEPVDLLI GSSYLKYISR EANVPLVRVG FPIFDRHHLH
RFPIVGYTGG LHLLTQLVNT VLDELDRTSP DHSFDVVR