Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | A9601_14221 |
Symbol | |
ID | 4718143 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. AS9601 |
Kingdom | Bacteria |
Replicon accession | NC_008816 |
Strand | - |
Start bp | 1195211 |
End bp | 1196134 |
Gene Length | 924 bp |
Protein Length | 307 aa |
Translation table | 11 |
GC content | 26% |
IMG OID | 640079143 |
Product | hypothetical protein |
Protein accession | YP_001009813 |
Protein GI | 123968955 |
COG category | [G] Carbohydrate transport and metabolism [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0451] Nucleoside-diphosphate-sugar epimerases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCAAAAAG AATTGTTAGT AACTGGATCA TCAGGATTTT TTGGAAGTGC ATTAATAAAT AGAGCCCTAA AAAGGGGATG GTTTGTTAAG GGCACAGCAA GACATTCTTT AGGAATTCTC TCTGAACAAT TTGGTGTTGA TATCAATTAT TTAGATTTAT CAAAAGATAC AATTTCAATA CCAAAAGCTA ATTATATAGT TCATTGCGCA ACTGCTAATG AAATTAAGTC TTTAGATTTA TTTAAATCTA TCGATTCAAC TATAAAAGGC ACAAAAAAAT TAATTGAATA TTGTTTAGAA AATCGATTTG AGCATTTTAT TTATATTTCG ACTGTTGGAA TTTATGGAAG AGAACTTAAT GGAGAAATTA ATGAAAATTC TCCTTTTCAA GCAAATTCTA ATTATGCTTT AAATCATTAT TATGCAGAAA AAATTTGTGA AAGATATGCC TCAAGAAATT TTAAAGTGAC AATAATAAGA TTATCCAATG TTTATGGAAT TCCTTCTGTT AGCACTGTAG ATAGAAATAC ATTGGTACCT ATATGCTTTG TAGTTAATTT ATTAAGAAAA GGTGTTGTAG AATTAAATTC TTCTGGACTT CAGCAAAGGG ATTTTATTAA TCAAATTGAA GCATCAGATA TAGTATTAAA TTCCTTAAAT AATCAGAAAA GTAATTTCGA TATAATTAAT GCTTCAAGCG GAAAAAGTTA TTCAATTATC GAAATTGCAA AAATTGCATG TCAAGAATAT TCTAAATTTT CAGGAAAAGT TGGAAAAATA ACTTCAATGC CTGATAAAAA TAATTATGAA AATAACTATA GTTTTTCTAG TAAGGCTTAT AAAAGTAAAG ACAAAAATTT AGAATATCTT TCAATAAATG AAACTATTTC AGAGTTATTT AAAATTTATA ATGCATTAAT TTAA
|
Protein sequence | MQKELLVTGS SGFFGSALIN RALKRGWFVK GTARHSLGIL SEQFGVDINY LDLSKDTISI PKANYIVHCA TANEIKSLDL FKSIDSTIKG TKKLIEYCLE NRFEHFIYIS TVGIYGRELN GEINENSPFQ ANSNYALNHY YAEKICERYA SRNFKVTIIR LSNVYGIPSV STVDRNTLVP ICFVVNLLRK GVVELNSSGL QQRDFINQIE ASDIVLNSLN NQKSNFDIIN ASSGKSYSII EIAKIACQEY SKFSGKVGKI TSMPDKNNYE NNYSFSSKAY KSKDKNLEYL SINETISELF KIYNALI
|
| |