Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | NATL1_08471 |
Symbol | |
ID | 4780616 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. NATL1A |
Kingdom | Bacteria |
Replicon accession | NC_008819 |
Strand | - |
Start bp | 777552 |
End bp | 778610 |
Gene Length | 1059 bp |
Protein Length | 352 aa |
Translation table | 11 |
GC content | 43% |
IMG OID | 640084122 |
Product | chlorophyll a/b binding light harvesting protein PcbD |
Protein accession | YP_001014670 |
Protein GI | 124025554 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR03041] chlorophyll a/b binding light-harvesting protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.679252 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 6 |
Fosmid unclonability p-value | 0.00174435 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | GTGCAGACCT ACGGGAATCC TAGCGTTACC TATGACTGGT ACGCGGGTAA TTCAGGGACG GCCAATCGCT CCGGAAAATT CATCGCTGCG CATGCTGCCC ATGCTGGTTT GATGATGTTC TGGGCAGGTG CGTTCACTTT ATTTGAGCTA GCTCGTTATG ACTCATCCGT TGCGATGGGT AATCAAAACT TGATCTGCTT GCCTCACCTT GCACAACTTG GAATAGGTGG AATCGAAAAC GGAGTAATAA CTGAGCCTTA TGGTTGCACA GTTATTGCTG TACTTCACCT AATTTTCTCT GGTGTTCTTG GTGCTGGTGG AATTCTTCAC TCAACAAGAT ATGACGGTGA TTTAGGAAAC TATCCTGAAG GAAGTCGTCC TAAAAAGTTT GACTTCGAGT GGGACGATCC AGACAAACTT ACATTTATTC TTGGTCACCA CCTTATTTTC CTAGGTTTGG CAAACATTCA ATTCGTTGAA TGGGCTCAAT ATCATGGTAT TTGGGATACT GCTTTAGGAG CTACTCGTAC AGTTTCTTAC AACCTAGATT TAGGAATGAT ATGGAATCAC CAAGCTGATT TCCTTCAAAT CACCAGTTTG GAAGATGTTA TGGGCGGTCA TGCTTTCTTA GCATTTTTCC AAATTATTGG TGGTGCATTC CACATCATCA CTAAGCAATT TGGTGAGTAT ACAGAATTCA AAGGTAAAGG ACTTCTTTCC GCTGAAGCTG TTCTTTCATA CTCATTAGCT GGTGTTGGCT ATTGTGCACT TGTTGCAGCT TTCTGGAGTT CAACAAACAC AACTGTTTAC TCGACAGAAT TCTTCGGAGA CGTACTTCAA CTTAAGTTTG ATTTCGCTCC TTATTTTGTT GATACGGACT CATCACTTGC GACTGGCGCT CATACAGCTA GAGCTTGGTT AGCTAATGTT CACTTCTATC TTGGCTTCTT CTTCATCCAG GGGCATCTCT GGCATGCACT AAGAGCTATG GGATTTGACT TCAGACGCGT AGGTAAAGCG TTCGACAATA TGGAAAACGC AAAAATCACT AACGGTTAA
|
Protein sequence | MQTYGNPSVT YDWYAGNSGT ANRSGKFIAA HAAHAGLMMF WAGAFTLFEL ARYDSSVAMG NQNLICLPHL AQLGIGGIEN GVITEPYGCT VIAVLHLIFS GVLGAGGILH STRYDGDLGN YPEGSRPKKF DFEWDDPDKL TFILGHHLIF LGLANIQFVE WAQYHGIWDT ALGATRTVSY NLDLGMIWNH QADFLQITSL EDVMGGHAFL AFFQIIGGAF HIITKQFGEY TEFKGKGLLS AEAVLSYSLA GVGYCALVAA FWSSTNTTVY STEFFGDVLQ LKFDFAPYFV DTDSSLATGA HTARAWLANV HFYLGFFFIQ GHLWHALRAM GFDFRRVGKA FDNMENAKIT NG
|
| |