Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tery_4138 |
Symbol | |
ID | 4245652 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Trichodesmium erythraeum IMS101 |
Kingdom | Bacteria |
Replicon accession | NC_008312 |
Strand | + |
Start bp | 6383569 |
End bp | 6385107 |
Gene Length | 1539 bp |
Protein Length | 512 aa |
Translation table | 11 |
GC content | 41% |
IMG OID | 638109039 |
Product | nitrogenase molybdenum-iron protein beta chain |
Protein accession | YP_723619 |
Protein GI | 113477558 |
COG category | [C] Energy production and conversion |
COG ID | [COG2710] Nitrogenase molybdenum-iron protein, alpha and beta chains |
TIGRFAM ID | [TIGR01286] nitrogenase molybdenum-iron protein beta chain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.453924 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 25 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGTCAGA ATGTAGACAA AATTAAAGAT CACTTTCAAC TTTTCCAAGA GCCAGAATAC CAAGAAATGT TCGCTCGGAA AAGAGAATTT GAAGGCGGTG CTTCCAAAGA AGAAATAGAA AGAGTTCGTG AGTGGACAAA AAGTTGGGAA TATCGTGAGA AGAACTTTGC TCGTGAGGCT CTAACTATCA ACCCTGCTAA AGCTTGTCAG CCTTTAGGTG CAATATTTGC AGCTGCAGGT TTTGAAGGAA CTCTTCCTTT TGTACATGGT TCTCAAGGAT GTGTTGCTTA CTTCCGTTCT CACTTAACTC GTAACTACAA AGAACCATTC CAAGCGGTTT CCTCTTCTAT GACTGAAGAT GCTGCTGTAT TTGGTGGTCT GAAAAATATG ATTGATGGTT TGGCAAACTC TTATGCTTTG TACAAGCCTA AAATGATTGC TCTTTGCACC ACTTGTATGG CAGAGGTAAT TGGAGATGAC TTGGGTTCAT TCATTACCAA CTCCAAAAAT GAAGGTGCAG TACCTCAAGA TTTCCCAGTT CCTTTTGCTC ACACTCCTAG CTTTGTTGGT TCTCATATCA CAGGCTATGA CAATATGCTC AAGGGTATCC TAATAGCTCT TACTGACGGT AAGAAGACAG AAACTGATAA TGGAAAAATC AACTTTATCC CTGGTTTCGA CCCTTACATT GGCAACATCC GGGATTTAAA GAATATTCTG TCTTTAATGG ATGTTCCTAG CACTGTTTTA GCTGACAACG CTGAGAGTTT TGATTCTCCT AACTTGGGTG AATTCAAGAT GTACAATGGT GGTACAACTC TAGAAGAAGC GGGTGATTCC ATCAATGCTA AAGCTACTAT TTCCTTCCAA AAATACAGCA CTCCTAAGAC TCTAGAGTAC CTGAAACAAG AAGGTGGTCA AAAAACAGCT ACATACCGCC CTATTGGTGT TCGTGGTACA GATGAGTTCT TAATGGCTTT GTCTGAATTG ACTGGTAAGG CTATTCCTGA AGAGTTAGAA ATTGAGCGTG GTCGTGTAGT TGATGCTATC ACTGACTCTC AAGCTTGGTT GCACGGTAAG CGTATTGCTA TCTACGGTGA TCCTGACCAT GTATTGGGCT TGTTGAATTT CACTCTAGAA TTAGGTATGC AACCAGTTCA CGTTGTTGTA AATAACGGTA ACGTTGCTGG TTTTGAAGAA GAAGCTAAGG AATTGTTAGC TAATGATCCT AATGGCAAAG AAGCTACAGT TTGGATCGGT AAGGACTTAT GGCACTTACG TTCATTGTTG GATACTGAGC CAGTTGATTT GTTAATTGGT AACTCATACG GTAAGTTCCT ACAACGTGAC ACTGGTACTC CATTAGTACG TATTGGCTAT CCTATTTTCG ACCGCCATCA CCAACACCGT TATTCTATCT TAGGATATAA GGGAGCATTC AACCTCATCA ACTGGATCGT TAATACTATC CTTGATGAAT TAGACCGTGG TAGCATGGAT CTAGGTGTTA ACGATACATC TTTTGACTTG GTTCGTTAA
|
Protein sequence | MSQNVDKIKD HFQLFQEPEY QEMFARKREF EGGASKEEIE RVREWTKSWE YREKNFAREA LTINPAKACQ PLGAIFAAAG FEGTLPFVHG SQGCVAYFRS HLTRNYKEPF QAVSSSMTED AAVFGGLKNM IDGLANSYAL YKPKMIALCT TCMAEVIGDD LGSFITNSKN EGAVPQDFPV PFAHTPSFVG SHITGYDNML KGILIALTDG KKTETDNGKI NFIPGFDPYI GNIRDLKNIL SLMDVPSTVL ADNAESFDSP NLGEFKMYNG GTTLEEAGDS INAKATISFQ KYSTPKTLEY LKQEGGQKTA TYRPIGVRGT DEFLMALSEL TGKAIPEELE IERGRVVDAI TDSQAWLHGK RIAIYGDPDH VLGLLNFTLE LGMQPVHVVV NNGNVAGFEE EAKELLANDP NGKEATVWIG KDLWHLRSLL DTEPVDLLIG NSYGKFLQRD TGTPLVRIGY PIFDRHHQHR YSILGYKGAF NLINWIVNTI LDELDRGSMD LGVNDTSFDL VR
|
| |