Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | A9601_07021 |
Symbol | |
ID | 4717405 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. AS9601 |
Kingdom | Bacteria |
Replicon accession | NC_008816 |
Strand | + |
Start bp | 625898 |
End bp | 627100 |
Gene Length | 1203 bp |
Protein Length | 400 aa |
Translation table | 11 |
GC content | 33% |
IMG OID | 640078415 |
Product | hypothetical protein |
Protein accession | YP_001009095 |
Protein GI | 123968237 |
COG category | [S] Function unknown |
COG ID | [COG2138] Uncharacterized conserved protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTGGAACTTC AAGCAAACTA TTATTTTTCA CTACATGATA TTGTTCTTTA CTATTATGGT ATAAGTAATA ACTTGATTTT GGATAATTTA GATTCGAAGT TAAATAATCA AGTCGCGATA CTTATCTGTG GACACGGCAG TAGAAATAAA CTAGCCATTA CTGAATTTCA AGAATTAACT CAGTTTATCC AAAAAAGATA TCCAAACTAT TTGGTTGAAT ATGGTTTCTT GGAATTCGCT AAACCTTCAC TTGTTGATGC TCTAGACAAA TTAAGAGATC TTTCTATAAA AAAAGTAATT GCAATACCCG CAATGCTTTT CGCTGCTGGC CATGTGAAAA ATGATATACC TAGCTTGCTT ATGAATTATT CAAGTAAAAC AGGTATTGAA ATAATTTATG GAAGAGAATT AGGTATTAAT AATTTAATGA TTAGTGCAGC TTGTGAAAGA GTTAAAGATG TATTTAAACA AAATAATACA CTTAAACCTG AAGAATCATT ATTAGTTGTT GTTGGTAGAG GCTCTTCTGA CCCAGATGCG AATTCCAATG TTTCAAAAAT TACGAGAATG ATCGTAGAAG GTATTGGTTT AGGGTGGGGG GAAACAGTTT TTTCTGGGGT AACTTTCCCT CTAGTTGAAC CTGGCTTGAA AAATGTTGCG AGACTTGGTT ATAAAAATAT AATTATTTTC CCTTATTTCC TTTTCTCAGG TGTCCTTGTC ACAAGAATAA AAAGGCAAAG TGATTTAGTT GCTATTAATA ATCCAAATAT TTCATTTATA CATGCAAAAT ATCTTTCGTC ACAGTCTTAT GTGGTCGACA CTTTTGTAGA AAGGATTGAA GAGATTCTTA ATAACGAAGG TAATAATTTT ATGAATTGCT CAACCTGTAA ATATAGGTCA AATTTATTTG GCTTTGAAAA AGAAGTTGGA ATGGTACAAG AAAGTCATCA TGACCATGTA GAGGGCTTGG GTATCAGTTG TGATTTATGT GATCCTGAAT GTAATGGTGC TTGTGAAATA CAAAATCAAA TACCAACTCA TAACCAAGAA AAATCAAACT CAGGAGGAGG AGATTACTTG GAACATGAAC ATGTGGAGGC TCATCAACAT GAACATGATC ACCATCACCA TCACCATAGT ATTTATCCAA ATTCAAAACA CCCTTTAGGA CCTGTCACGC TTCGCTTGCC TAATAAAGAC TAA
|
Protein sequence | MELQANYYFS LHDIVLYYYG ISNNLILDNL DSKLNNQVAI LICGHGSRNK LAITEFQELT QFIQKRYPNY LVEYGFLEFA KPSLVDALDK LRDLSIKKVI AIPAMLFAAG HVKNDIPSLL MNYSSKTGIE IIYGRELGIN NLMISAACER VKDVFKQNNT LKPEESLLVV VGRGSSDPDA NSNVSKITRM IVEGIGLGWG ETVFSGVTFP LVEPGLKNVA RLGYKNIIIF PYFLFSGVLV TRIKRQSDLV AINNPNISFI HAKYLSSQSY VVDTFVERIE EILNNEGNNF MNCSTCKYRS NLFGFEKEVG MVQESHHDHV EGLGISCDLC DPECNGACEI QNQIPTHNQE KSNSGGGDYL EHEHVEAHQH EHDHHHHHHS IYPNSKHPLG PVTLRLPNKD
|
| |