Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | A9601_10981 |
Symbol | |
ID | 4717809 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. AS9601 |
Kingdom | Bacteria |
Replicon accession | NC_008816 |
Strand | + |
Start bp | 938171 |
End bp | 939178 |
Gene Length | 1008 bp |
Protein Length | 335 aa |
Translation table | 11 |
GC content | 33% |
IMG OID | 640078813 |
Product | nitrogen regulation protein NifR3 family-like protein |
Protein accession | YP_001009489 |
Protein GI | 123968631 |
COG category | [J] Translation, ribosomal structure and biogenesis |
COG ID | [COG0042] tRNA-dihydrouridine synthase |
TIGRFAM ID | [TIGR00737] putative TIM-barrel protein, nifR3 family [TIGR00742] tRNA dihydrouridine synthase A |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCTTCAA ATATAAGGCT AAAAGGAAGG GGAGTTAACA GAAAAATTAC GAGTAAGGTA ATGCTATCGC CATTAGCAGG AGTTACGGAT AACATTTTTA GACGACTTGT ACGTAAATGG GCTCCAAACT CTTTACTTTT TACAGAAATG ATAAATGCCA CAAGTCTTAA AAAAGGATAT GGCACACAAA AAATCAATCA AATAGATTTA GAAGAAGGTC CAATTGGAGT ACAAATATTT GATAATAGGC CATATGCTGT TTCTGAAGCT GCAAAACAAG CTGAGGACTC TGGAGCTTTC TTAATCGATA TAAATATGGG ATGTCCAGTA AAAAAAATTG CAAAGAAAGG TGGAGGCAGT GCCTTAATTA AAGACCGAAA ACTTGCTATA GAATTAGTCA AAAATGTTGT AAAAGCTGTT AGGGTTCCTG TAACAGTAAA AACACGACTC GGATGGGATA GTAAAGAAGA AAATATAGAG GATTTCTTAT TTAAACTTCA AGATGCGGGA GCAACCATGA TCACACTTCA TGGAAGAACT AGAAAACAGG GTTTTTCAGG CAAGTCAGAT TGGGAAATGA TCGGGAGACT TAAAAAGTTG TTGGAAATTC CAGTAATTGC TAATGGAGAT ATCAAAAATC CAGATGACGC TCTTAATTGT TTGAAAAAAA CAAAAGCTGA TGGTGTAATG ATTGGACGAG GAATTTTAGG ATCCCCATGG AAAATAGGAG AAATAGATTA TGCTCTTAGA GAAAATAAAA ATTTTAAAGA ACCAAACACA GAAGAAAAAC TATATTTAAT TATTGAGCAT CTTGATGAAT TAATAAAAGA AAAAGGAGAT CACGGTTTGC TAATCGCAAG GAAACATATC TCATGGACAT GCAAAGACTT TAAAGGGGCA TCAAATTTGA GAAATAACTT GGTTAGAGCT GTTGATAAAA ATGAAGTTAA AAATTTAATA AATAAAATGA TTCAAACTTT GAATAATGAA AAAAATAGAT TAGCTTAA
|
Protein sequence | MSSNIRLKGR GVNRKITSKV MLSPLAGVTD NIFRRLVRKW APNSLLFTEM INATSLKKGY GTQKINQIDL EEGPIGVQIF DNRPYAVSEA AKQAEDSGAF LIDINMGCPV KKIAKKGGGS ALIKDRKLAI ELVKNVVKAV RVPVTVKTRL GWDSKEENIE DFLFKLQDAG ATMITLHGRT RKQGFSGKSD WEMIGRLKKL LEIPVIANGD IKNPDDALNC LKKTKADGVM IGRGILGSPW KIGEIDYALR ENKNFKEPNT EEKLYLIIEH LDELIKEKGD HGLLIARKHI SWTCKDFKGA SNLRNNLVRA VDKNEVKNLI NKMIQTLNNE KNRLA
|
| |