Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tery_4139 |
Symbol | |
ID | 4245653 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Trichodesmium erythraeum IMS101 |
Kingdom | Bacteria |
Replicon accession | NC_008312 |
Strand | + |
Start bp | 6385521 |
End bp | 6388331 |
Gene Length | 2811 bp |
Protein Length | 936 aa |
Translation table | 11 |
GC content | 40% |
IMG OID | 638109040 |
Product | bifunctional nitrogenase molybdenum-cofactor biosynthesis protein NifE/NifN |
Protein accession | YP_723620 |
Protein GI | 113477559 |
COG category | [C] Energy production and conversion |
COG ID | [COG2710] Nitrogenase molybdenum-iron protein, alpha and beta chains |
TIGRFAM ID | [TIGR01283] nitrogenase molybdenum-iron cofactor biosynthesis protein NifE [TIGR01285] nitrogenase molybdenum-iron cofactor biosynthesis protein NifN |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.820547 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 24 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCTATGA TAACCCCAGG CAAAGTTGCC GAACTTTTAA ATGAGCCAGG TTGCGAACAC AACCATCAAA AAAATAACGG AGATAAAAAA CAGAAAGGCT GCCAGCAACA AGCAGCACCC GGAGCTGCTC AAGGGGGTTG TGCCTTTGAT GGTGCTAGTA TTGCTCTAGT TCCAATTACA GATGCAGCTC ACCTGGTTCA CGGTCCTTTA GGTTGTTCTG GTAACTCCTG GGGAGCTCGT GGAAGTCTTT CCTCCAGTTC CCATCTATAC AAAATGGGTT TCACCACTGA TATGGGTGAA AATGACATTA TTATGGGCGG AGAGAAAAAA TTATTGAGGG CGATCGCTGA ATTAAAAAAA CGTTATCAAC CTCCTGCTAT ATTTGTCTAT GCCACCTGTG TTACTGCTTT AATTGGAGAT GACCTAGAAA CAGTTTGTAA AGTCGCTACT AAAAAATTAG AAATTCCGGT CATTCCCGTT AACTCTCCTG GTTTCATCGG TAGCAAAAAT CTGGGTAACC GGGTTGGCGC AGAAGCATTA TTAGACCATG TAGTAGGTAC GGCAGAACCT GAATTTACTA CACCTTATGA CATAAACCTA ATTGGGGAAT ATAATATTGC TGGGGAAATG TGGGCAATGT TACCCCTGTT TGAAAAAGTT GGCATTCGGG TATTGTCAAA AATCACGGGA GATGCTACTT ACAAAGAAGT TTGTTATGCC CACCGTGCCA AACTCAACGT TATGATCTGC TCTAAAGCTA TGATCAATAT GGCACGAAAA ATGGAGGAAG AATATGGCAT TCCTTACATT GAAGAATCAT TTTATGGTGT TGCTGATATT AATAACTGCC TCCGGAATAT TGCTGCCAAA ATTGGGGATG CTGGCCTGCA AGAACGGACA GAAAAACTGA TCGCTGAAGA AACAACTATT CTTGAAGAAA TATTAGAACC CTATCGCCAA CGTCTAAAAG GTAAAAAAGT TGTACTCTAC ACAGGTGGGG TGAAAAGTTG GTCTATTATT TCGGCAGCTA AAGATTTAGA GATGGATGTG GTTGCTACTA GCACCAAGAA AAGTACAGAA GAGGATAAGG CTAAAATCAA AAAGTTGCTC GGTAAAGATG GTATTTTGCT AGAAAAAGGT AATGCCGAAA TACTCTTAAA AGTAATTGCC GAGACTAAAG CTGATATGCT TATAGCAGGT GGTCGTAACC AGTATACTGC TCTCAAAGCC CGAATTCCCT TTTTACACAT TAACCAAGAA CGTCATCATC CCTACGCCGG ATATCATGGG ATGATAGAAA TGGCTAAAGA ATTAGATGAA GCTCTCTATA GTCCGGTTTG GGAACAGGTG AGACAACCGG CACCTTGGTT AGAGGCATGT CAGTTAGATG ATGTTTTGGC TGTTGAAACT TTACCAAGTT TAACTAATAT TCCACCAACA ACAGTCAATT TTCATAAACA ATCATTATCC ACAAATCCTC TGAAACTCAG TCAACCTTTG GGTGCGGCTT TAGCATACTT AGGAATTAAT GGGATGATGC CAATGTTCCA CGGTACTCAA GGTTGTACTG CTTTTGCTAA GGTATTATTG GTGAATCACT TCCAGGAAGC TATTCCCTTG TCTACTACTG CTATGAGTGA AGTGACAACT ATTTTGGGAG GGGAGGATAA CATTGATAAA GCGCTGCTGA CTCAGCTTGA GAAGTCAAAA CCAAAGGTAA TTGGTTTGTT GACTACTGGT TTAACTGAAA CCAGAGGGGA TGACATGGAG CGTATTCTTA AGAAATTTAG GGAAGAGCAT CCAGAATTAG ATGGGTTCCC CATATTAAAT GTTTCTAGTC CAGATTATAA GGGTTCTGCT CAGGACGGTT TTGCTACTAC AGTAGAATGT ATAGTAGGTT ATGATTATGG GGAGCCTATT CCCAAAGAAA TTAAAAAACC TTTGATTACA ATTTTAGCTG GTTCTTGTCT TGCTCCTGGA GATGTCCAGG AAATCAAGGA TATTGTAGAA GATTTTGGGT TCATTCCAAT AGTTGTACCG GACTTATCTC AGTCTTTGGA CGGGCATTTA ATTGATGATA TTTATAGTGC TACAAGCTCA GGTGGTACCA CAATAGAGGA TTTGCGTAAT TTACGTCACT CATCTTTCAC CTTTGCTATT GGGGAAAGTA TGCGAAATGC AGCTATAATT TTGCAAGAAA AATTTGGTAC TCAATATCAA GTGTTCTCTC GGTTGACTGG TTTGGGTGCT GTAGATAGTT TCATGTTAAA ATTGTCTCAG CTAATTGTAT CTCGAATTGA TCCTCATCTT GACAAAGGCT GTGAAGTTCC TGAGAAATAC CTACGCCAAC GCCGTCAGTT ACAGGATGCG ATGTTGGATA CTCACTTCTA TTTTGGGCAT AAGCAAGTAT CTATTGCCCT GGAACCAGAT TTACTTTGGG CAACAAGTTG GTTTGTGCGA GAAATGGGGG GAGATATTCA TTCTGCTGTC ACAACTACGC GATCGCCGTT ACTTGAAAAA TTACCTACGG AAAATGTGAT AGTTGGGGAC TTGGGAGATT TAGAAGAAGT GGCAGCAGGT TCGGATTTAC TAATTACCAA TTCTCGTGGT AAGATAATAT CTGAGAAGTT AAATATTGAT CTCTATCGGA TGGGAATGCC AATTTACGAT CGCCTTGGTA ATGGTCAACG TTGTTCTGTT GGTTACCGTG GTACAATGAA TTTATTATTT GATATTGGTA ATATTTTCCT AGAGCAAGAG GAATCAAAAA TTCACACCAA TGATTATTCA TTATTAAGTA CTCAGGCATA A
|
Protein sequence | MAMITPGKVA ELLNEPGCEH NHQKNNGDKK QKGCQQQAAP GAAQGGCAFD GASIALVPIT DAAHLVHGPL GCSGNSWGAR GSLSSSSHLY KMGFTTDMGE NDIIMGGEKK LLRAIAELKK RYQPPAIFVY ATCVTALIGD DLETVCKVAT KKLEIPVIPV NSPGFIGSKN LGNRVGAEAL LDHVVGTAEP EFTTPYDINL IGEYNIAGEM WAMLPLFEKV GIRVLSKITG DATYKEVCYA HRAKLNVMIC SKAMINMARK MEEEYGIPYI EESFYGVADI NNCLRNIAAK IGDAGLQERT EKLIAEETTI LEEILEPYRQ RLKGKKVVLY TGGVKSWSII SAAKDLEMDV VATSTKKSTE EDKAKIKKLL GKDGILLEKG NAEILLKVIA ETKADMLIAG GRNQYTALKA RIPFLHINQE RHHPYAGYHG MIEMAKELDE ALYSPVWEQV RQPAPWLEAC QLDDVLAVET LPSLTNIPPT TVNFHKQSLS TNPLKLSQPL GAALAYLGIN GMMPMFHGTQ GCTAFAKVLL VNHFQEAIPL STTAMSEVTT ILGGEDNIDK ALLTQLEKSK PKVIGLLTTG LTETRGDDME RILKKFREEH PELDGFPILN VSSPDYKGSA QDGFATTVEC IVGYDYGEPI PKEIKKPLIT ILAGSCLAPG DVQEIKDIVE DFGFIPIVVP DLSQSLDGHL IDDIYSATSS GGTTIEDLRN LRHSSFTFAI GESMRNAAII LQEKFGTQYQ VFSRLTGLGA VDSFMLKLSQ LIVSRIDPHL DKGCEVPEKY LRQRRQLQDA MLDTHFYFGH KQVSIALEPD LLWATSWFVR EMGGDIHSAV TTTRSPLLEK LPTENVIVGD LGDLEEVAAG SDLLITNSRG KIISEKLNID LYRMGMPIYD RLGNGQRCSV GYRGTMNLLF DIGNIFLEQE ESKIHTNDYS LLSTQA
|
| |