Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | A9601_06831 |
Symbol | |
ID | 4717386 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. AS9601 |
Kingdom | Bacteria |
Replicon accession | NC_008816 |
Strand | + |
Start bp | 606573 |
End bp | 607631 |
Gene Length | 1059 bp |
Protein Length | 352 aa |
Translation table | 11 |
GC content | 39% |
IMG OID | 640078396 |
Product | light-harvesting complex protein |
Protein accession | YP_001009076 |
Protein GI | 123968218 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR03041] chlorophyll a/b binding light-harvesting protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCAAACCT ATGGAAATCC AGATACTACC TATGGATGGT GGGCTGGTAA TTCAGGTGTA GCAAATCGCT CAGGAAAATT CATTGCTGCT CATGTAGCTC ATGCAGGATT AATTGTTTTC TGGGCGGGTG CATTCACCCT TTTTGAACTT TCACGATTTG ACCCAAGCGT CCCAATGGGT CATCAACCTC TAATCGTTCT TCCTCACTTA GCAACTCTTG GAATAGGGTT TGATGCTAAT GGTGTTGCGA TGGGAGATAC TAAACCTGTT CTAGCGATAG CAATAGTTCA CTTAGTTTCT TCTATGGTTT TAGCAGCCGG AGGACTTTTA CACTCTTTAC TTCTTCCTGG AAATCTAGAA GATTCTGATG TAGCAAGAGC TAGAAAATTC AATATTGAAT GGGATAATCC AGACAAATTG ACATTTATTC TTGGTCACCA TCTAATTATT CTTGGTTTCG CAGTTATTGC TTTTGTCGAA TGGGCAAGAG TGCATGGAAT TTATGATCCA GCTATTGGTT CTGTAAGACA GGTTGAGTAT GAATTAAATT TGGCCAAAAT TTGGAATCAC CAAACAGACT TTTTGACTAT TGATAGCCTT GAAGAAGTAA TGGGAGGTCA TGCTTTCCTC GCTTTCGTTG AGATCACTGG TGGTGCTTGG CATATTGCTA CTAAGCAAGT TGGTGAATAT ACCAAATTCA AAGGTAAAGG ACTTCTCTCT GCAGAAGCTG TTCTCTCATG GTCATTAGCT GGAATAGGTT GGATGGCTAT TATTGCAGCT TTCTGGAGTG CAGCTAACAC AACAGTTTAT CCAACTGAAT TCTTTGGTGA ACCACTTGAA TTGAAGTTTA GTATTTCGCC TTATTGGGTA GATACAGTTG ATCTTCCTGA TGGTGAGTAC ACTTCAAGGG CATGGTTAGC TAATGTTCAT TACTATTTTG GATTCTTCTT TATTCAAGGT CATCTATGGC ACGCTTTAAG AGCACTAGGC TTTGATTTCA AGAGAGTTAC AAATGCTATC AGTAATATTG ATAGTGCAAC AGTTACTCTT AAAGATTAA
|
Protein sequence | MQTYGNPDTT YGWWAGNSGV ANRSGKFIAA HVAHAGLIVF WAGAFTLFEL SRFDPSVPMG HQPLIVLPHL ATLGIGFDAN GVAMGDTKPV LAIAIVHLVS SMVLAAGGLL HSLLLPGNLE DSDVARARKF NIEWDNPDKL TFILGHHLII LGFAVIAFVE WARVHGIYDP AIGSVRQVEY ELNLAKIWNH QTDFLTIDSL EEVMGGHAFL AFVEITGGAW HIATKQVGEY TKFKGKGLLS AEAVLSWSLA GIGWMAIIAA FWSAANTTVY PTEFFGEPLE LKFSISPYWV DTVDLPDGEY TSRAWLANVH YYFGFFFIQG HLWHALRALG FDFKRVTNAI SNIDSATVTL KD
|
| |