Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9303_22881 |
Symbol | |
ID | 4778706 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9303 |
Kingdom | Bacteria |
Replicon accession | NC_008820 |
Strand | + |
Start bp | 2019586 |
End bp | 2020794 |
Gene Length | 1209 bp |
Protein Length | 402 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 640087807 |
Product | hypothetical protein |
Protein accession | YP_001018288 |
Protein GI | 124023981 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG2109] ATP:corrinoid adenosyltransferase |
TIGRFAM ID | [TIGR00708] cob(I)alamin adenosyltransferase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.730155 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCTTCGC AGACGGCAGC ATCCAAGACA CCCAGAAATC GCCAAACATC AAGCGGCAGA AGAGGACGTG GGATTGGCAT CGCGACCGCA GCCGAAAGCA ATGAACGCAG TCATGGCCAA CTGCACATCT ATGACGGCGA GGGAAAAGGT AAAAGTCAGG CCGCTCTTGG TGTTGTCCTA CGCACCATCG GGCTCGGGAT CTGCGAGCAA CGACGAACCA GGGTGCTGTT GCTTCGATTC CTCAAAGGCC CAGGCCGCGC CTACGACGAA GATGCTGCTA TTGAGGCACT GCAACAGGGC TTCCCTCACC TAATCGACCA AGTGCGCACA GGTCGGGGTG AATTTTTCAT CGCCGAGGAC GCTACCCGTT TTGATCGCCA AGAAGCCCAA CGCGGCTGGG ATATCGCCAA GGGAGCCATC GCCAGCGCCC TCTATTCAGT CGTTGTGCTT GATGAACTCA ATCCCGTGCT GGATCTAGGC CTGCTGGCTG TTGACGATGT AGTCAAAACA CTCACCGACC GACCTGACGG CATGGAGATC ATCGTCACCG GACGGGCCGC TCCACGAGCC CTGGTTCAAG TGGCCGATCT GCATTCCGAA ATGCGCGCCC ATCGTCGTCC GGAGCCTCAG GATGACAGCG TCATTCCCTT CCTTCCCACC GGTGGGATTG AGATCTACAC CGGTGAGGGC AAAGGCAAAT CCACCAGCGC CCTAGGCAAG GCTCTACAGG CCATTGGCCG TGGCATCAGC CAAGACAAAA GCCATCGCGT GTTGATCTTG CAGTGGCTTA AAGGGGGCAA TGGCTACACC GAAGACGCTG CCATTGCGGC CTTAAGAGAA AGCTATCCCC ACCTTGTTGA TCATCTTCGC TCCGGCCGGG ATGCCATCGT CTGGCGAGGT CAACAACAAC CGATCGATTA CGTCGAAGCA GAGCGCGCAT GGGAAATTGC CAGAGCCGCC ATCGCCAGTG GGCTCTATAA AACTGTCATC CTCGATGAAC TCAATCCCAG CGTGGATCTT GAACTGATTC CTGTGGAGCC GATTGTTCAG ACCCTGCTGC GCAAGCCCGC CGAAACCGAA GTCATCATCA CCGGCCGCTG CAAAAACCAA CCTGCCTACT TCGATCTGGC CGGCGTCCAC TCAGAAATGG TGTGCCACAA GCACTATGCA GAACAGGGCG TAGATCTCAA GCGAGGCGTG GATTACTAA
|
Protein sequence | MASQTAASKT PRNRQTSSGR RGRGIGIATA AESNERSHGQ LHIYDGEGKG KSQAALGVVL RTIGLGICEQ RRTRVLLLRF LKGPGRAYDE DAAIEALQQG FPHLIDQVRT GRGEFFIAED ATRFDRQEAQ RGWDIAKGAI ASALYSVVVL DELNPVLDLG LLAVDDVVKT LTDRPDGMEI IVTGRAAPRA LVQVADLHSE MRAHRRPEPQ DDSVIPFLPT GGIEIYTGEG KGKSTSALGK ALQAIGRGIS QDKSHRVLIL QWLKGGNGYT EDAAIAALRE SYPHLVDHLR SGRDAIVWRG QQQPIDYVEA ERAWEIARAA IASGLYKTVI LDELNPSVDL ELIPVEPIVQ TLLRKPAETE VIITGRCKNQ PAYFDLAGVH SEMVCHKHYA EQGVDLKRGV DY
|
| |