Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | A9601_04781 |
Symbol | |
ID | 4717176 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. AS9601 |
Kingdom | Bacteria |
Replicon accession | NC_008816 |
Strand | + |
Start bp | 414083 |
End bp | 415573 |
Gene Length | 1491 bp |
Protein Length | 496 aa |
Translation table | 11 |
GC content | 27% |
IMG OID | 640078190 |
Product | hypothetical protein |
Protein accession | YP_001008873 |
Protein GI | 123968015 |
COG category | [R] General function prediction only |
COG ID | [COG3046] Uncharacterized protein related to deoxyribodipyrimidine photolyase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.289419 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAACAAG TATCAATTAT TTTCCCGAAT CAACTTTTTA GAGAAAGCCC AATCTTAAAA ATAAATTGTG AAGTTTTGAT TTTGGAAGAC TCATTATTTT TTGGAAATGA TAAATTTCAT AAATTAATTA ACCATAAAAA TAAGTTAGTT TTTCATAGAG CATCTATGCT CGCTTATAAA AATTATTTAG AAATATCTGG CTTTAAAGTT TTGTATATCG AAAACAAGAA TAATGTTTCT ACAGTTGATT ACTTATCGGA ATTTATTAAA AATAAATATC AGAAAATAAA TCTCATTGAC CCTCATGATT TTTTAATAAT GAAGAGGATT AATAATTTTG TTGAAAGTAA TAATTTAACT TTAAATATTT TGCCTTCTCC TATGTTTATG AGCAGTGAAG ATTTAAAAAA TTTATTTGTA TCAAATGCAA AAAAACCTCT TATGGGGAGA TTTTATGAGA ATCAAAGAAA GAGCCAAAAG ATATTAGTTA ATTCTGATGA TACACCTGAA GGTGGTAAAT GGAGTTTTGA TGAAATGAAC AGAAAAAAAT TACCAAAAAA AATAAATATA CCCGACACAC CAAAATTACA AAAAAATAAA TTTGTAGTTA ATGCAGAAAG GTCATTAGCT AATTTTGATA TTGAGTTTAT TGGAGAAAGT AATAACTTTT TATATCCAAC TAATTTTGAA GAGGCAGATG AATGGTTAAA TGATTTTTTT AAACATAAAT TTTTCTTATT TGGAGATTAT GAGGATGCTA TTTCTAAAGA AAATTCTTTT TTATGGCACA GTTTACTTTC TCCTCTTTTA AATAGCGGCT TATTAACCCC AGATGTAGTA GTAAATAAAG CATTACTTTT TGCAAAAAAT AATAATGTTC CTATTAACTC TTTAGAGGGT TTTATTCGTC AAATTATTGG ATGGAGAGAA TTTATTTGCC TCGTCTATAA AAAGTACGGA ACAAAGATGC GAAATAGTAA TTTTTGGAAT TTTGAAGATA AGCCAATTCC AAAATCTTTT TATCAAGGAA ATACAGGAAT TGAACCAGTA GACGTTGTTA TAAAAAATAT TATTAAATTT GGTTATTGTC ATCATATTGA GCGGCTAATG ATTGTTGGCA ACTTTATGCT TCTATGTAGA ATTCACCCCA ACCAAGTTTA TAAATGGTTT ATGGAAATGT TTATTGATTC CTATGATTGG GTTATGGTCC CAAATGTTTA CGGAATGAGT CAGTTTAGTG ATGGTGGTAT CTTTTCAACA AAGCCATATA TATCAAGCTC TAATTATGTA AAAAAAATGT CTAATTTTAA AAGTGGTCTA TGGTGCGAAA TATGGGATGG CTTATTTTGG AAATTTATTA AAGATAATGA AAGCTTTTTT AGAAAGCAAT ATCGTCTGGC AATGTTAACG AGAAATCTCG ATAAAATGTC AGCGGAAAAA TTAAATAATC ACTTAAAAAA GGCCGATAAA TTTTTAAGAG ATATTCAATA A
|
Protein sequence | MKQVSIIFPN QLFRESPILK INCEVLILED SLFFGNDKFH KLINHKNKLV FHRASMLAYK NYLEISGFKV LYIENKNNVS TVDYLSEFIK NKYQKINLID PHDFLIMKRI NNFVESNNLT LNILPSPMFM SSEDLKNLFV SNAKKPLMGR FYENQRKSQK ILVNSDDTPE GGKWSFDEMN RKKLPKKINI PDTPKLQKNK FVVNAERSLA NFDIEFIGES NNFLYPTNFE EADEWLNDFF KHKFFLFGDY EDAISKENSF LWHSLLSPLL NSGLLTPDVV VNKALLFAKN NNVPINSLEG FIRQIIGWRE FICLVYKKYG TKMRNSNFWN FEDKPIPKSF YQGNTGIEPV DVVIKNIIKF GYCHHIERLM IVGNFMLLCR IHPNQVYKWF MEMFIDSYDW VMVPNVYGMS QFSDGGIFST KPYISSSNYV KKMSNFKSGL WCEIWDGLFW KFIKDNESFF RKQYRLAMLT RNLDKMSAEK LNNHLKKADK FLRDIQ
|
| |