Gene Franean1_6878 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_6878 
Symbol 
ID5675191 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp8381242 
End bp8382615 
Gene Length1374 bp 
Protein Length457 aa 
Translation table11 
GC content72% 
IMG OID641245727 
Productnitrogenase molybdenum-cofactor biosynthesis protein NifE 
Protein accessionYP_001511118 
Protein GI158318610 
COG category[C] Energy production and conversion 
COG ID[COG2710] Nitrogenase molybdenum-iron protein, alpha and beta chains 
TIGRFAM ID[TIGR01283] nitrogenase molybdenum-iron cofactor biosynthesis protein NifE 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.785272 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.310406 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCGCCA CTGACCGTGC AACGCTATTC ACCGAACCGG CATGTGATCA CAACCGCGAG 
AAGTCCGCCA AGGAGCGCAA GGCCGGGTGC CCCAAGCCCG CGCCCGGCGG CACCAGCGGC
GGATGCACCT TCGACGGTGC GATGATCACA CTGGTTCCGA TCGTCGACAG CGCCCACGTC
GTGCACGGCC CGATCGCCTG CGCAGGAAAC TCCTGGGACG GGCGGGGGAG CCTCTCCTCC
GGCCCCGACC TCTACCGCCG CGGCTTCACC AGCGACGTCG GTGAGCAGGA CGTCATCTTC
GGCGGCGAGC AGCGCCTGTT CGACACCATT CTCGAAGCCG TCCGCCGGCA CCACCCGCCG
GCGGTGTTCG TCTACTCGAC CTGCGTCACC GCCATGATCG GCGACGACCT CGACGCCGTC
TGCTCGGCGG CCGCCGAGCA CACCGGCGTC CCGGTCATCC CGGTGCACGC TCCGGGGTTC
GCCGGCAACA AGAACCTCGG CAACCGCCTC GCCGGCGAGG CCCTGCTCGA GCACGTCATC
GGGACGGTCG AGCCGCCCGA CGTGACCGAG CTCGACGTGA ACCTGGTCGG CGAGTACAAC
ATCGCCGGCG AGCTGTGGGA CGTCCTGCCG GTGCTGGCCA AGATGGGCAT CCGGGTCCGG
GCCTGCATCA GCGGCGACGC GCGCTACGCG GACGTCGCCG CGGCGCACCG GGCCCGCGCG
ACGATGGTCG TCTGCTCCCG GGCGCTGCTG GGCCTGGCCC GCGGCCTGGA GGATCGCTAC
GGCATCCCCT GGTTCGAGGG CAGCTTCTAC GGCGTCCGCG CGATGAACGA CACCCTGCGC
GAGTTCGCCC GCCTGCTCGG CGGCGCGGAG CTGGCCCGGC GCGCCGAGGA GGTCATCGCC
CTCGAGCAGA CCGCCGTCGA CCTGGCCCTG GAGCCCTACC GCGAGCGGCT CGCCGGCAGG
CGCGCGGTGC TCTACACCGG CGGGGTCAAG AGCTGGTCGA TCGTCTCCGC GCTGCAGGAC
CTCGGCATCG AGGTCGTCGC GAACGGCATC ACCAAGAGCT CCGACGGCGA CGTCGAGAAG
ATCCGCGAAC TGCTCGGCCC GGACGCCAGG ATCGTCTCCG AGGGCAGCCC GCGCGAGCTG
CTGCGCATCG CCGAGGAGAC CCGCGCGGAC ATCCTCGTCG CCGGCGGCCG CAACCAGTAC
ACGGCGCTCA AGGGCCGGCT GCCGTTCCTC GACATCAACC AGGAGCGGCA CATCCCCTAC
GCGGGCTACC GCGGCGCGGT CGAGCTGGCC CGCCGCCTCG ACATGGCGCT GTCGAACCCG
GTGTGGGAGC AGGTCCGCGC CCCCGCGCCC TGGGACGTCG AGGGGGTGGC GTGA
 
Protein sequence
MAATDRATLF TEPACDHNRE KSAKERKAGC PKPAPGGTSG GCTFDGAMIT LVPIVDSAHV 
VHGPIACAGN SWDGRGSLSS GPDLYRRGFT SDVGEQDVIF GGEQRLFDTI LEAVRRHHPP
AVFVYSTCVT AMIGDDLDAV CSAAAEHTGV PVIPVHAPGF AGNKNLGNRL AGEALLEHVI
GTVEPPDVTE LDVNLVGEYN IAGELWDVLP VLAKMGIRVR ACISGDARYA DVAAAHRARA
TMVVCSRALL GLARGLEDRY GIPWFEGSFY GVRAMNDTLR EFARLLGGAE LARRAEEVIA
LEQTAVDLAL EPYRERLAGR RAVLYTGGVK SWSIVSALQD LGIEVVANGI TKSSDGDVEK
IRELLGPDAR IVSEGSPREL LRIAEETRAD ILVAGGRNQY TALKGRLPFL DINQERHIPY
AGYRGAVELA RRLDMALSNP VWEQVRAPAP WDVEGVA