Gene Francci3_4485 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_4485 
Symbol 
ID3907461 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp5355035 
End bp5356405 
Gene Length1371 bp 
Protein Length456 aa 
Translation table11 
GC content69% 
IMG OID637881817 
Productnitrogenase molybdenum-cofactor biosynthesis protein NifE 
Protein accessionYP_483560 
Protein GI86743160 
COG category[C] Energy production and conversion 
COG ID[COG2710] Nitrogenase molybdenum-iron protein, alpha and beta chains 
TIGRFAM ID[TIGR01283] nitrogenase molybdenum-iron cofactor biosynthesis protein NifE 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCAGCCA CAGAACGCGT CGCGCTGTTC ACCGAACCGG CCTGTAACCA CAACAACGCC 
AAGAGCACCA AGGCCCGCAA GGCGGGTTGT CCCAAACCCA AGCCCGGGGG CACTTCCGGA
GGCTGCGCCT TCGACGGAGC CATGATCACC CTCGTCCCGA TCGCCGACTG CGCTCATGTG
GTCCACGGCC CGATCGGCTG CGCGGGCAAC TCGTGGGACG GCCGAGGCAG CCTGTCGTCC
GGGCCGGTGC TCTACCGGCA CGGCTTCACC ACCGACATGG CCGAGAACGA CGTGGTGCTC
GGGGGGGAAC AGCGCCTGTT CGACACCATC CTGGAGGTCG TCGAGCGGCA CTCGCCGCCA
GCGGTCTTCG TCTATTCGAC CTGCGTCACC GCGATGATCG GAGACGATCT CGACGCGGTG
TGCGCGGCTG CCGCCGCCGA GTCCGGGGTG CCGGTGATCC CGATCCACTC GCCCGGGTTC
GTCGGCAGCA AGAACCTCGG CAACCGGCTC GCCGGCCAGG CCCTGCTCGA CCACGTGATC
GGCACCGTTG AGCCGGAGCT CACCACCGAC CGGGACGTCA ACCTGATCGG TGAGTACAAC
ATCGCCGGGG AGTTGTGGGA CGTCCTGCCG CTGCTGTCCC GGCTGGGCCT TCGGGTGCAG
GCGTGCATCA GCGGTGATGC CCGCTACCGT GACGTCGCCG CGGCCCATCG GGCCAGGGCG
ACGATGGTGG TCTGCTCGCG CGCGCTGCTC GGACTCGCCC GTGGCCTGCA GGAACGCTAC
GGGATCCCGT GGTTCGAGGG CAGCTTCTAC GGCACCCGGG CGATGGGCGA CACCCTGCGC
GGCTTCGCCC GCCTGCTCGA GGACGACGAG CTCTCCCGCC GCACGGAGGA GCTCATCGCT
GTCGAGGAGG CGGCCGTCGC CGTCGCACTT GAGCCGTACC GGCGCAGGCT CGCCGGGCGT
AAGGCGGTGC TATACACCGG CGGGGTCAAG AGCTGGTCGA TCGTGTCCGC GCTGCAGGAC
CTCGGCATCG AGGTGGTCGG CAGCGGCATT ACCAAGAGCT CCGACGGCGA CGTGGACAAG
ATCCGCGAGC TGCTCGGCGA CGCAAAAATG ATCAAAGAGG GCAGTCCGCG CGAGCTGCTG
CGGGTCGCCG AACAGACCGG GGCGGACATC CTCGTCGCCG GCGGCCGCAA CCAGTACACC
GCGCTGAAGG GCCGGCTGCC GTTCCTCGAT ATCAACCAGG AGCGGCATAT CCCTTACGCC
GGCTACACCG GGATGGTCGA GCTCGCCCGG CGCCTCGACC TGGCGATCGC GAGCCCGGTC
TGGGAGCAGG TCAGCCGGCC CGCTCCCTGG GACCTGGTCG GAGCCCGCTG A
 
Protein sequence
MAATERVALF TEPACNHNNA KSTKARKAGC PKPKPGGTSG GCAFDGAMIT LVPIADCAHV 
VHGPIGCAGN SWDGRGSLSS GPVLYRHGFT TDMAENDVVL GGEQRLFDTI LEVVERHSPP
AVFVYSTCVT AMIGDDLDAV CAAAAAESGV PVIPIHSPGF VGSKNLGNRL AGQALLDHVI
GTVEPELTTD RDVNLIGEYN IAGELWDVLP LLSRLGLRVQ ACISGDARYR DVAAAHRARA
TMVVCSRALL GLARGLQERY GIPWFEGSFY GTRAMGDTLR GFARLLEDDE LSRRTEELIA
VEEAAVAVAL EPYRRRLAGR KAVLYTGGVK SWSIVSALQD LGIEVVGSGI TKSSDGDVDK
IRELLGDAKM IKEGSPRELL RVAEQTGADI LVAGGRNQYT ALKGRLPFLD INQERHIPYA
GYTGMVELAR RLDLAIASPV WEQVSRPAPW DLVGAR