Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | A9601_04371 |
Symbol | |
ID | 4717133 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. AS9601 |
Kingdom | Bacteria |
Replicon accession | NC_008816 |
Strand | - |
Start bp | 379721 |
End bp | 380686 |
Gene Length | 966 bp |
Protein Length | 321 aa |
Translation table | 11 |
GC content | 43% |
IMG OID | 640078147 |
Product | hypothetical protein |
Protein accession | YP_001008832 |
Protein GI | 123967974 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG4974] Site-specific recombinase XerD |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCAACAA TCAGAAGAAG CGGCAGCGGC TGGCAGGTCC TTATCAGAAG GAAGAATTAT GTAGGCCCGA GGTCCAGAAA TTTTCTTTCC AGAGATCTGG CTGAATCCTG GGCAGATGCA GTCGAAGAGA GAACAAAAAA GGTTTTAAAT GACACTCCCG TCACCCTGGG AGAGGCAATC AATGACTATA TAAATGGTCC ATTACTTCTG CACCGCAGTG CGGAGAATGA AAAATATCCT CTCAGGGTTA CTGCAGAAAG CTGGCTGGGA GATATTCCAC TAAGTGATCT GCAGATTAAG CATTTTGCAG TCTGGAGAGA TGAACGATTA CTGAAGGTGA AACCGAATAC AGTTATGCGG GAACTGAGGA TATTAAGAGT ATTGATTGAC TGGGCAAGAG ATGAAAGAGG AGCGGAAATA AAAGATAACC CCGCAAGGCA ACTGAGAGTG AGAGGGACAG GAGATGCAAG AGCTCCTTTT TTAACGAATG AAGATGAAAA AAGACTCCTG TTTGAACTGT CGCAGATGTC CAACCAAAAT CATCTGAGAC TCACAAAGCT TGCACTGACA ACTGGCTTCC GCCGCTCCGA ACTTTTGAGC CTGACCTGGA GAAATATAGA TCTGAAGAAA AAATTACTCT ATATATATAG AAAGAATTGT GCCGCAATAG ATAATTCATC CGGAATGAGG CTTGTTCCTT TCCCTGAAAA GGCGCAGAAG ATCCTTGAGG AATTACAGGG AAGAGATGGG AAAGTTATAG AACTTTCAAA AGGTGCTGCA AGAAATGGAT TTGATAAAGC CCGAAAAAAA GCAGGACTTG AAACTCTCAG ATTTCATGAT CTAAGACATA TAGCCATAAG CAGAATGTGG CGTTCGGGAA TGAGTGCCCT GGAGATAAGT GCATGCAGCG GCCACAGAGA TATAAAAATG TTGATGCGCT ACAGCCATTT TCAACTTTCC ATATAA
|
Protein sequence | MATIRRSGSG WQVLIRRKNY VGPRSRNFLS RDLAESWADA VEERTKKVLN DTPVTLGEAI NDYINGPLLL HRSAENEKYP LRVTAESWLG DIPLSDLQIK HFAVWRDERL LKVKPNTVMR ELRILRVLID WARDERGAEI KDNPARQLRV RGTGDARAPF LTNEDEKRLL FELSQMSNQN HLRLTKLALT TGFRRSELLS LTWRNIDLKK KLLYIYRKNC AAIDNSSGMR LVPFPEKAQK ILEELQGRDG KVIELSKGAA RNGFDKARKK AGLETLRFHD LRHIAISRMW RSGMSALEIS ACSGHRDIKM LMRYSHFQLS I
|
| |