Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tery_4137 |
Symbol | |
ID | 4245651 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Trichodesmium erythraeum IMS101 |
Kingdom | Bacteria |
Replicon accession | NC_008312 |
Strand | + |
Start bp | 6381887 |
End bp | 6383344 |
Gene Length | 1458 bp |
Protein Length | 485 aa |
Translation table | 11 |
GC content | 41% |
IMG OID | 638109038 |
Product | nitrogenase molybdenum-iron protein alpha chain |
Protein accession | YP_723618 |
Protein GI | 113477557 |
COG category | [C] Energy production and conversion |
COG ID | [COG2710] Nitrogenase molybdenum-iron protein, alpha and beta chains |
TIGRFAM ID | [TIGR01282] nitrogenase molybdenum-iron protein alpha chain [TIGR01862] nitrogenase component I, alpha chain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.303688 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 24 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCATCTG AAAAGATTGA ACAGAATAAA CAGTTAATTC AAGAAGTCTT AGACGCTTAT CCCGCGAAAG CTGCTAAGAG ACGGAAAAAG CACCTTAACG TAATCGAAGA AAAAGGAGCT GACTGTGGCG TTAAGTCTAA CGTAAAATCA GTTCCTGGTG TAATGACAAC TCGTGGTTGT GCATTTGCTG GAGCGAAAGG TGTGGTTTGG GGTCCTGTTA AGGACATGGT TCACATTAGT CACGGTCCTG TTGGTTGCGG TTACTACTCT TGGGCAGGTC GTCGTAACTA CTATAACGGT GTAACTGGTG TTGATACTTT CGGTACAATG CAATTCACCT CAGATTTCCA AGAGAGAGAT ATTGTTTTTG GTGGAGACAA AAAGCTCGCC AAAATTATGA ACGAAATTGA AGAGTTATTC CCTCTGAATG CTGGTATCAC AATTGAATCT GAATGTCCAG TAGGTCTAAT TGGTGATGAC ATTGAAGCGG TAGCGAAAAA AGCTAGCAAA GAACTCAATA AGCCAGTTGT ACCAGTACGT TGCGAAGGTT TCCGTGGTGT TTCTCAGTCA TTAGGTCACC ACATTGCTAA CGACACAGTG CGTGACTGGG TATACGAACC TTCTGCTAAA GTTACTAACG AAGAAATTGG TTTTGAGAAG ACTCCTTATG ACGTATCCTT AATTGCTGAT TACAACATCG GTGGTGACGG TTGGAGTTCT CGTTTGTTAT TAGATGAAAT TGGCTTAAGA GTTGTTAGCC AAGCAACAGG TGACGGTACT TATAACGAAG TATTCATGGC TCCTAGGGTG AACTTAAACC TCATCCACTG CTATCGTTCT ATGAACTATA TCTGCCGTTA CATGGAAGAA GAGTATGGTA TACCTTGGGT TGAGTTCAAC TTCTTCGGTC CTAGTCAAAT TGCTAAGTCT CTCCGGAAGA TTGCTTCTTT CTTTGATGAC AAAATCAAGG AAAACACAGA AAAAGTAATT GCTAGATATC AAGAACAAGC TGATGCAGTA ATTGCTAAGT ATCGTCCTCG TTTAGAAGGC AAGAAAGTAA TGATGATGGT TGGTGGTCTC CGTCCACGTC ACATTATTCC TGCTTTTGAC GATTTAGGAA TGGAAGTTAT TGGTACTGGT TATGAATTTG GTCACGGTGA CGACTACAAG CGTACTGCTG ACTATGCTCA AGAAGGTACT CTAATCTATG ATGACGTTAG TGGCTACGAA TTTGAAGAAT TTGCTAAGAA ATTAAAGCCA GATTTAATTG CTTCTGGTAT TAAAGAGAAG TATGTTTTCC AGAAGATGGG TATGCCATTC CGTCAAATGC ACTCTTGGGA TTATTCTGGT CCTTATCACG GTTATGACGG ATTCGCTATC TTCGCTCGTG ACATGGATCT AGCTCTCAAT AGCCCAACTT GGAACTTAAT CAAAGCTCCT TGGAAGCAAG CTAAGTAG
|
Protein sequence | MASEKIEQNK QLIQEVLDAY PAKAAKRRKK HLNVIEEKGA DCGVKSNVKS VPGVMTTRGC AFAGAKGVVW GPVKDMVHIS HGPVGCGYYS WAGRRNYYNG VTGVDTFGTM QFTSDFQERD IVFGGDKKLA KIMNEIEELF PLNAGITIES ECPVGLIGDD IEAVAKKASK ELNKPVVPVR CEGFRGVSQS LGHHIANDTV RDWVYEPSAK VTNEEIGFEK TPYDVSLIAD YNIGGDGWSS RLLLDEIGLR VVSQATGDGT YNEVFMAPRV NLNLIHCYRS MNYICRYMEE EYGIPWVEFN FFGPSQIAKS LRKIASFFDD KIKENTEKVI ARYQEQADAV IAKYRPRLEG KKVMMMVGGL RPRHIIPAFD DLGMEVIGTG YEFGHGDDYK RTADYAQEGT LIYDDVSGYE FEEFAKKLKP DLIASGIKEK YVFQKMGMPF RQMHSWDYSG PYHGYDGFAI FARDMDLALN SPTWNLIKAP WKQAK
|
| |