Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9303_21571 |
Symbol | |
ID | 4777449 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9303 |
Kingdom | Bacteria |
Replicon accession | NC_008820 |
Strand | + |
Start bp | 1916505 |
End bp | 1917515 |
Gene Length | 1011 bp |
Protein Length | 336 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 640087665 |
Product | putative nitrogen regulation protein NifR3 family protein |
Protein accession | YP_001018157 |
Protein GI | 124023850 |
COG category | [J] Translation, ribosomal structure and biogenesis |
COG ID | [COG0042] tRNA-dihydrouridine synthase |
TIGRFAM ID | [TIGR00737] putative TIM-barrel protein, nifR3 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.0855264 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGCCTGAAC TCAGCCTGCC CGGCAGGGGA ACAACAAGGG CACTGCGCTG CAGAGTGCTG CAATCGCCCC TTGCAGGCGT GAGTGATCAG ATCTTCCGAA GCCTGGTTCG GCGCTGGGCA CCAGATGCAT TGTTATTCAC GGAAATGGTC AATGCCACCA GCCTGGAGCT GGGCCACGGC CTACAGAAAA TCGACGAACT CGCCAATGAA GCCGGGCCCA TTGGCGTGCA ATTGTTCGAT CACCGCCCAG AAGCGATGGC CGATGCTGCA CAGCGAGCAG AGGCTGCTGG TGCCTTCCTG ATCGACATCA ACATGGGCTG TCCTGTGCGC AAGATTGCAC GCAAGGGCGG TGGCTCTGGT CTGATCCGTG ACCCACAACT AGCGGCAAAA ATCGTGAGCA CTGTTGCTGC AGCAGTCAAA ATCCCAGTCA CCGTGAAGAC AAGGCTGGGC TGGTGTGGCA GCGATGCAAG CCCTGTGCGC TGGTGTCAAT GGCTCGAGCA GGCCGGCGCC CAGATGTTGA CATTGCATGC TCGAACTCGA GAGCAAGGCT TCAAAGGCTC AGCTGATTGG CTCGCCATCG CTGCCGTCAA AAGCGCACTG CAAATCCCCG TGATTGCCAA TGGCGATGTC AAGAGCGACA TGGATGCCAA GCGCTGCCTG GCGATCACTG GAGCCGATGG CGTGATGGTG GGCAGAGGCT CGATGGGAGC GCCATGGCTT GTGGGTCAAA TCGATGCAGC ACTATCCGGC ATACCCGTGC CTGCAACGCC TGGAGCGGCA GAGCGACTCA CCATTGCTCG TGAACAACTT GAAGCGCTGG TACAAGCAAA GGGGGAACAT GGACTCTTGA TTGCCCGTAA ACACATGGGA TGGACTTGTA GCGGCTTCGT CGGCGCGTCG AAACTGCGCC ATGCCCTGAT GCGTGCACCA ACGCCAACGG ATGCCATATC ACTGTTAGAG CAAGCCAGCG CAGAACTCAT AACGGCATGG CCTGAAGCGA CCAACGCCTA A
|
Protein sequence | MPELSLPGRG TTRALRCRVL QSPLAGVSDQ IFRSLVRRWA PDALLFTEMV NATSLELGHG LQKIDELANE AGPIGVQLFD HRPEAMADAA QRAEAAGAFL IDINMGCPVR KIARKGGGSG LIRDPQLAAK IVSTVAAAVK IPVTVKTRLG WCGSDASPVR WCQWLEQAGA QMLTLHARTR EQGFKGSADW LAIAAVKSAL QIPVIANGDV KSDMDAKRCL AITGADGVMV GRGSMGAPWL VGQIDAALSG IPVPATPGAA ERLTIAREQL EALVQAKGEH GLLIARKHMG WTCSGFVGAS KLRHALMRAP TPTDAISLLE QASAELITAW PEATNA
|
| |