Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9211_11631 |
Symbol | |
ID | 5731884 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9211 |
Kingdom | Bacteria |
Replicon accession | NC_009976 |
Strand | - |
Start bp | 1060116 |
End bp | 1061174 |
Gene Length | 1059 bp |
Protein Length | 352 aa |
Translation table | 11 |
GC content | 44% |
IMG OID | 641285531 |
Product | hypothetical protein |
Protein accession | YP_001551048 |
Protein GI | 159903704 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR03041] chlorophyll a/b binding light-harvesting protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.0312993 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCAGACCT ACGGAAACCC AGATGTCACC TATGGGTGGT GGGTTGGTAA TTCTGTAGTC ACCAACAGGG CAGGTCGATT TATCGGCTCT CATGTTGGAC ATACAGGAAT TATTTGCTTC GCAACTGGTG CAAGTTGTCT TTGGGAGCTT TCCCGTTTTG ATTCTTCAAT CCCTATGGGG CATCAAAGCT CTATATATCT TTCCCATTTA GCTTCTCTAG GCATTGGCTT TGACGAAGCT GGTGTATGGA CAGGAGCAGG AGTTGCCACA ATCGCAATTT TCCATCTGAT CTTCTCCTTG GTCTATGGCG GTGCTGGGCT TCTTCACTCA TTACTTTTCG ATCCAGATCT ACAAAGTGGA CCGATTGGTC GAGTTGATAA GTTTAAGCTT GAATGGGACA GCCCTGACAA TCTCACTTTC ATTCTTGGTC ATCATTTGAT CTTCTTAGGT GTTGCAAATA TCTGGTTCGT TGAATGGGCC AGAGTTCATG GCATCTATGA CCCCGCCGTT GAAGCTGTTC GCACAGTTTT CCCTGGTTAT GGGGATTTCG GAATGGTTTG GGGCCACCAA TTTGACTTCC TTAAGATAGA CAGTCTTGAA GACGTAATGA GTGGTCACGC ATTCTTAGCT TTCTTACAGA TCAGTGGTGG TGCTTTCCAT ATAGCAACTC GGCAAATTGG TGAATACACC AAATTTAAAG GCCAGGGTTT GCTTTCAGCA GAAGCAGTAC TTTCTTGGTC ATTAGCTGGC CTTTTCTTGA TGGGCTTAGT TGCAGCGTTT TGGGCTGCTG GAAACACAAC TGTCTATCCC ACTGAATGGT ATGGAGAACC TTTAGAGCTT AAGTTCGGCA TTTCTCCTTA TTGGGTAGAT ACAGGAGATG TTTCTGACTG CAAATACTTT TTTGGACACA CAACTAGGGC AGCTCTAGTC AATGTTCAAT ATTATTTTGC TTTCTTCTGC TTACAAGGCC ATTTATGGCA TGCTCTAAGA GCATTGGGCT TTGATTTCAG AAGGATTGCT CAGGCAATAG GTGGTTTGAC AGAGTCAACT GCTTCTTAG
|
Protein sequence | MQTYGNPDVT YGWWVGNSVV TNRAGRFIGS HVGHTGIICF ATGASCLWEL SRFDSSIPMG HQSSIYLSHL ASLGIGFDEA GVWTGAGVAT IAIFHLIFSL VYGGAGLLHS LLFDPDLQSG PIGRVDKFKL EWDSPDNLTF ILGHHLIFLG VANIWFVEWA RVHGIYDPAV EAVRTVFPGY GDFGMVWGHQ FDFLKIDSLE DVMSGHAFLA FLQISGGAFH IATRQIGEYT KFKGQGLLSA EAVLSWSLAG LFLMGLVAAF WAAGNTTVYP TEWYGEPLEL KFGISPYWVD TGDVSDCKYF FGHTTRAALV NVQYYFAFFC LQGHLWHALR ALGFDFRRIA QAIGGLTEST AS
|
| |