Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9211_15621 |
Symbol | |
ID | 5730302 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9211 |
Kingdom | Bacteria |
Replicon accession | NC_009976 |
Strand | + |
Start bp | 1394015 |
End bp | 1395154 |
Gene Length | 1140 bp |
Protein Length | 379 aa |
Translation table | 11 |
GC content | 40% |
IMG OID | 641285940 |
Product | hypothetical protein |
Protein accession | YP_001551447 |
Protein GI | 159904103 |
COG category | [R] General function prediction only |
COG ID | [COG1253] Hemolysins and related proteins containing CBS domains |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 36 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGGTCAAGA AAGCGAGTTC TCCGCTCTTG GAAAATAATC TGGTTACTCA CAAGAAAGTC TTCATTATTA GTTATCTAGG ATGGCAGGAC ATCGAACTTA GTGGAAAAAT AAAGTTCATA AGACTCTTCA TCATCACTGA AGATCCCCAA GGTATGAGTC CTGACCTCCT ATTCCTAGCA ATTCTTGTTG TAGTTGTAAT AATTGGCTCT GCGTTGTGCT CTGGAGTAGA AGCTGCATTC CTAGCAGTTA ATCCATTACG AGTACATGAA CTTGCCGCTA AGAAAAGACC AGTGAATGGA GCGAGAAGGT TAGAAAAACT CAGACATCGC CTAGGAAGGA CGCTAACAGT ACTAACAATT ACAAATAATG GCTTCAACAT CTTTGGGAGT TTAATGCTCG GCAGTTATGC AACGTTTGTT TTTAAAAGTG GGATGGTACT ACCTTTGTTC TCAATAGGTT TAACCATCTT GGTATTGCTA TTAGGAGAAA TAGTCCCAAA ATCAATTGGG ACGAGGTTTT CCTTGAAAGT TTCACTTGCA AGTGCGCCTA TTCTTACTTT ACTGAGCCTC ATAATGAGAC CTGTAATCAT TCCACTAGAG CGATTGCTGC CTGTAATTAC TACAGAAAAT GAAATAAGCA CGGATGAAGA GGAAATTACG CAAATGGCCA GACTTGGTTC GCAAAAAGGG TATATAGAAG CCGATGAAGC CGCAATGATT TCTAAAGTCT TTCAATTAAA TGATTTAACA GCAAGAGACC TTATGACTCC AAGAGTGGCA GCACCAACCC TTGATGGCTC CATAAGCTTA GAAGAGGTGA GGCCAAACTT AATCACTAAT CACTCTCAAT GGTGGGTAGT TCTAGGCAAA GAAGTAGATA AAGTAATAGG AGTAGTTAAT CGAGAACAGC TTCTTACTGC ATTACTACAA GGACAAAATC AACTTACTCC AAAAGATCTC GCAGAAAAAG TAGAGTTTGT CCCAGAAATG ATTCGAGTAG ATAGATTGCT GAATAATTTT ACTGAAGATA AGAATGGCGT CAGAGTTGTC GTAGATGAGT TCGGAGGTTT TGTTGGTTTA ATTGGAGCAG AAGCAGTCTT AGCTGTTTTG GCAGGTTGGT GGAGGAAATC CAAGATATGA
|
Protein sequence | MVKKASSPLL ENNLVTHKKV FIISYLGWQD IELSGKIKFI RLFIITEDPQ GMSPDLLFLA ILVVVVIIGS ALCSGVEAAF LAVNPLRVHE LAAKKRPVNG ARRLEKLRHR LGRTLTVLTI TNNGFNIFGS LMLGSYATFV FKSGMVLPLF SIGLTILVLL LGEIVPKSIG TRFSLKVSLA SAPILTLLSL IMRPVIIPLE RLLPVITTEN EISTDEEEIT QMARLGSQKG YIEADEAAMI SKVFQLNDLT ARDLMTPRVA APTLDGSISL EEVRPNLITN HSQWWVVLGK EVDKVIGVVN REQLLTALLQ GQNQLTPKDL AEKVEFVPEM IRVDRLLNNF TEDKNGVRVV VDEFGGFVGL IGAEAVLAVL AGWWRKSKI
|
| |