Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Francci3_4485 |
Symbol | |
ID | 3907461 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. CcI3 |
Kingdom | Bacteria |
Replicon accession | NC_007777 |
Strand | - |
Start bp | 5355035 |
End bp | 5356405 |
Gene Length | 1371 bp |
Protein Length | 456 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 637881817 |
Product | nitrogenase molybdenum-cofactor biosynthesis protein NifE |
Protein accession | YP_483560 |
Protein GI | 86743160 |
COG category | [C] Energy production and conversion |
COG ID | [COG2710] Nitrogenase molybdenum-iron protein, alpha and beta chains |
TIGRFAM ID | [TIGR01283] nitrogenase molybdenum-iron cofactor biosynthesis protein NifE |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 24 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCAGCCA CAGAACGCGT CGCGCTGTTC ACCGAACCGG CCTGTAACCA CAACAACGCC AAGAGCACCA AGGCCCGCAA GGCGGGTTGT CCCAAACCCA AGCCCGGGGG CACTTCCGGA GGCTGCGCCT TCGACGGAGC CATGATCACC CTCGTCCCGA TCGCCGACTG CGCTCATGTG GTCCACGGCC CGATCGGCTG CGCGGGCAAC TCGTGGGACG GCCGAGGCAG CCTGTCGTCC GGGCCGGTGC TCTACCGGCA CGGCTTCACC ACCGACATGG CCGAGAACGA CGTGGTGCTC GGGGGGGAAC AGCGCCTGTT CGACACCATC CTGGAGGTCG TCGAGCGGCA CTCGCCGCCA GCGGTCTTCG TCTATTCGAC CTGCGTCACC GCGATGATCG GAGACGATCT CGACGCGGTG TGCGCGGCTG CCGCCGCCGA GTCCGGGGTG CCGGTGATCC CGATCCACTC GCCCGGGTTC GTCGGCAGCA AGAACCTCGG CAACCGGCTC GCCGGCCAGG CCCTGCTCGA CCACGTGATC GGCACCGTTG AGCCGGAGCT CACCACCGAC CGGGACGTCA ACCTGATCGG TGAGTACAAC ATCGCCGGGG AGTTGTGGGA CGTCCTGCCG CTGCTGTCCC GGCTGGGCCT TCGGGTGCAG GCGTGCATCA GCGGTGATGC CCGCTACCGT GACGTCGCCG CGGCCCATCG GGCCAGGGCG ACGATGGTGG TCTGCTCGCG CGCGCTGCTC GGACTCGCCC GTGGCCTGCA GGAACGCTAC GGGATCCCGT GGTTCGAGGG CAGCTTCTAC GGCACCCGGG CGATGGGCGA CACCCTGCGC GGCTTCGCCC GCCTGCTCGA GGACGACGAG CTCTCCCGCC GCACGGAGGA GCTCATCGCT GTCGAGGAGG CGGCCGTCGC CGTCGCACTT GAGCCGTACC GGCGCAGGCT CGCCGGGCGT AAGGCGGTGC TATACACCGG CGGGGTCAAG AGCTGGTCGA TCGTGTCCGC GCTGCAGGAC CTCGGCATCG AGGTGGTCGG CAGCGGCATT ACCAAGAGCT CCGACGGCGA CGTGGACAAG ATCCGCGAGC TGCTCGGCGA CGCAAAAATG ATCAAAGAGG GCAGTCCGCG CGAGCTGCTG CGGGTCGCCG AACAGACCGG GGCGGACATC CTCGTCGCCG GCGGCCGCAA CCAGTACACC GCGCTGAAGG GCCGGCTGCC GTTCCTCGAT ATCAACCAGG AGCGGCATAT CCCTTACGCC GGCTACACCG GGATGGTCGA GCTCGCCCGG CGCCTCGACC TGGCGATCGC GAGCCCGGTC TGGGAGCAGG TCAGCCGGCC CGCTCCCTGG GACCTGGTCG GAGCCCGCTG A
|
Protein sequence | MAATERVALF TEPACNHNNA KSTKARKAGC PKPKPGGTSG GCAFDGAMIT LVPIADCAHV VHGPIGCAGN SWDGRGSLSS GPVLYRHGFT TDMAENDVVL GGEQRLFDTI LEVVERHSPP AVFVYSTCVT AMIGDDLDAV CAAAAAESGV PVIPIHSPGF VGSKNLGNRL AGQALLDHVI GTVEPELTTD RDVNLIGEYN IAGELWDVLP LLSRLGLRVQ ACISGDARYR DVAAAHRARA TMVVCSRALL GLARGLQERY GIPWFEGSFY GTRAMGDTLR GFARLLEDDE LSRRTEELIA VEEAAVAVAL EPYRRRLAGR KAVLYTGGVK SWSIVSALQD LGIEVVGSGI TKSSDGDVDK IRELLGDAKM IKEGSPRELL RVAEQTGADI LVAGGRNQYT ALKGRLPFLD INQERHIPYA GYTGMVELAR RLDLAIASPV WEQVSRPAPW DLVGAR
|
| |