Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tery_4135 |
Symbol | |
ID | 4245649 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Trichodesmium erythraeum IMS101 |
Kingdom | Bacteria |
Replicon accession | NC_008312 |
Strand | + |
Start bp | 6379583 |
End bp | 6380458 |
Gene Length | 876 bp |
Protein Length | 291 aa |
Translation table | 11 |
GC content | 41% |
IMG OID | 638109036 |
Product | Fe-S cluster assembly protein NifU |
Protein accession | YP_723616 |
Protein GI | 113477555 |
COG category | [C] Energy production and conversion [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0694] Thioredoxin-like proteins and domains [COG0822] NifU homolog involved in Fe-S cluster formation |
TIGRFAM ID | [TIGR02000] Fe-S cluster assembly protein NifU |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.0000225432 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 23 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTGGGAAT ATACTGAAAA GGTAATGGAT TTGTTCTATA ACCCCCAAAA CCAAGGAACT ATTACAGACA AAAAAGAGGG GGAAAAGATA GTTAGTGGTG AAGTAGGAAG TATAGCCTGT GGAGATGCTT TAAGTCTACA CCTCAAAGTA AATGAAGCCT CCGGTGAAAT ATTAGATGCC AAATTTCAAA CCTTTGGCTG TGCAAGTGCG ATAGCTTCAT CTTCTGCCCT AACAGCAATG CTCAAAGGCA AAACCATAGA CGAAGCCATG AATATTAAAA ACCAGGATAT TGCCGGATAC CTGGGAGGGC TGCCAGAAGA AAAAATGCAC TGTTCAGTCA TGGGAGAGGA AGCATTAGAA GCGGCAATAT TTAAGTACAA AGGTATTGAA GTAGAAGTTC ACGAAGAAGA CGACGAAGGA TCATTAGTTT GTAGTTGTTT TGCGATAACA GAAAACAAGA TTAAGCGAGT TATTTTGGAA AACAATCTCA AAACAGCGGA AGAAGTAACA AACTATGTCA AAGCTGGTGG TGGTTGTGGT TCTTGTCTGG CAGATATTGA TGATCTCGTC GCATCGGTTT ATGAAGCGCC AGACACTACA ACGCAACAAA TTCCTACAAC TACTAAACCA GCAACCAACC TGACAAACTT GCAAAAAATT ACATTAATTC AGCAAGTATT ACAACAAGAG GTGAGGCCAG TTCTCGCCGA AGATGGAGGA GATGTTGAGT TATTCGATGT AGATGGCGAT CGCGTGCTAG TCAAACTCAA AGGAGCTTGT GGTTCTTGCA GTAATGTGCT AGTAACGCTA AAAGGAGCGA TCGAAGCTAC ATTAAAAGAA CGAGTTAGTG AAAGTCTTGT AGTAGAAGCG GTATAA
|
Protein sequence | MWEYTEKVMD LFYNPQNQGT ITDKKEGEKI VSGEVGSIAC GDALSLHLKV NEASGEILDA KFQTFGCASA IASSSALTAM LKGKTIDEAM NIKNQDIAGY LGGLPEEKMH CSVMGEEALE AAIFKYKGIE VEVHEEDDEG SLVCSCFAIT ENKIKRVILE NNLKTAEEVT NYVKAGGGCG SCLADIDDLV ASVYEAPDTT TQQIPTTTKP ATNLTNLQKI TLIQQVLQQE VRPVLAEDGG DVELFDVDGD RVLVKLKGAC GSCSNVLVTL KGAIEATLKE RVSESLVVEA V
|
| |