Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | A9601_14191 |
Symbol | |
ID | 4718140 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. AS9601 |
Kingdom | Bacteria |
Replicon accession | NC_008816 |
Strand | + |
Start bp | 1191765 |
End bp | 1192823 |
Gene Length | 1059 bp |
Protein Length | 352 aa |
Translation table | 11 |
GC content | 30% |
IMG OID | 640079140 |
Product | hypothetical protein |
Protein accession | YP_001009810 |
Protein GI | 123968952 |
COG category | [G] Carbohydrate transport and metabolism [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0451] Nucleoside-diphosphate-sugar epimerases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAATAATT ATTTAATAAC AGGAAACTTG GGTTATATAG GGCCTGTTTT ATGTAAATTT ATAAAAAAAA ATGATCCTAA TTCATTCATA ACCGGATATG ACAATGGTTA CTTCGTTAAT TGCCCCATAG CCGCAGAGGT ATTGACGGAA AAACCCTTCT TAGATTTACA AGTATATGGT GACGTTAGAG ATAAAAATAA ATTATCCAAA TATATTGAGA AAGCAGATTT TATTGTTCAC TTAGCAGCTA TTTCAAATGA TCCAATGGGT AGCGAATTTG CGAAAGTGAC AAAAGAGATT AATCAAGAAT CAAGTTTATT CATAGCCAAA GAGGCTATAA ATAATAATGT AAAATCATTT GTTTTTGCAT CATCATGTTC AGTATATGGA ACCGGATCAG ATTCACCAAG AACTGAAAAA GATCCTGTTA AACCGCTAAC TGCTTATGCA AAAAGTAAGG TAGGCACTGA GAATGATTTA TTAGGTTTAA TTAATTATAA AAACACAAAA ATCACCTCAC TTAGATTTTC TACCGCCTGC GGGTATAGTC CAAATTTAAG GCTTGATCTT GTTCTTAATG ATTTTGTCGC GACAGCAATT AATTCAGGCA AAATTGAAAT ATTAAGTGAC GGGTCTCCAT GGAGGCCACT TATAGATGTA GAAGATATGG CAAGATCTAT TTTCTGGGCA TGCAATAGAT TATCTGGGAA ACAAATGGAG GTTATAAATG TGGGATCACA AGATTGGAAC TACCAAATAA AGGATTTAGC TTTTGAAATT AAAAAATTAT TAGGAGAAGA TATTCATATA AAAATCAATA AGTCTGCTGC ACCTGATAAA AGATCTTATA AAGTTTGTTT TGATAAATAC TTTAATTTAA CCCCTCAAAA TTTTACACCT CAAATAAATT TGAAAAAATC TGTAATAAGA ATGTTAAAGG CTCTCAAACC ATTTAAAGAA AGACTTTCAA GAGAAGATAG AAATCATTTA ATTAGATTAA ATGCTCTTAA ATCTCTCAAA CAAAACAACT TAATAGATGT AAATTTAAAT TGGAAATAA
|
Protein sequence | MNNYLITGNL GYIGPVLCKF IKKNDPNSFI TGYDNGYFVN CPIAAEVLTE KPFLDLQVYG DVRDKNKLSK YIEKADFIVH LAAISNDPMG SEFAKVTKEI NQESSLFIAK EAINNNVKSF VFASSCSVYG TGSDSPRTEK DPVKPLTAYA KSKVGTENDL LGLINYKNTK ITSLRFSTAC GYSPNLRLDL VLNDFVATAI NSGKIEILSD GSPWRPLIDV EDMARSIFWA CNRLSGKQME VINVGSQDWN YQIKDLAFEI KKLLGEDIHI KINKSAAPDK RSYKVCFDKY FNLTPQNFTP QINLKKSVIR MLKALKPFKE RLSREDRNHL IRLNALKSLK QNNLIDVNLN WK
|
| |