Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9303_21261 |
Symbol | |
ID | 4777111 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9303 |
Kingdom | Bacteria |
Replicon accession | NC_008820 |
Strand | + |
Start bp | 1887535 |
End bp | 1889247 |
Gene Length | 1713 bp |
Protein Length | 570 aa |
Translation table | 11 |
GC content | 48% |
IMG OID | 640087634 |
Product | NHL repeat-containing protein |
Protein accession | YP_001018126 |
Protein GI | 124023819 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.404692 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAACCCA TTTCCTTTCC CATCATTGGA GGTGCATTGC TGGGCTCGCT GTCGGTCTTG CTTGCCTCCT GCGATGTCTC CAGCATCACG GGAAATGCGT CAAGCTTCTC AGGGAGGGTT TCCGTATTAG GAAAGCCCGT CAGTGAAGCC AAGGTTTTGC TCTGGCAGTC TTCCGCCAAG GATGGAGCGA AGAAAATCGT TGAGACCAAA ACCTCTAAAG ATGGGGCGTT CAAATTGTCC ATACCCCGCA ATAAGGATAA GGTCACTTAT TTAATTTCCG AAGGAGGAAG TATCGATGGG CAAGATGCTG GCAACTTGAA AATGCTCAGT GTTCTTGACT CCAATTCCAA CAAGTCTGTC GTCATCAACG AACTGACATC CGTGGGATCA GTGTGGCCCA ATGCTCAACT GATCAACGAA AATAAAATCA GTGGTAGCAA AACAGCTTTA GCCATTGGTT CGGGACATGT AAGCCACCTA GTCAATACAA GCACTGGCAA ATATGGCGAA ACACTTCTGA ATGGCTTAAA CCTACCCAAC TCCGAGACGA TGGTTCGTCT CAACGTTCTT TCCAACCTAT TGGCCCTTTG TGGCAATAGC AATACAGCCG ATGGCTGTAG TCAGCTTCTC AATCTAACCA ATTCAGATAA CAGCCTCAAC GCACTGATTT CTATCGCCAA ACAGCCGTGG GCAAAAGCCA GTGAGCTCTA CAAACTGTTC ACTCAACAAT ATCCACTCAA CAAGGCAGAA CAAATTCGAG AAGGCGCAAC CCTTCCCTAC CTGTTATTTG AACCGGAGAG TTTTTCACTC TCGATCCGTC TAGAGGGTGG CGGCGCAATG GCACTCGGAA AAATGATGTT TGACGACAAC GCCAATATGT GGAGTGGTGC CAACTGGATG CCTGGTTCCC AATCTGGTGT CATCAACTCC ATTGGAGGAG GGCTCACCAG GTTCAACGCC AGCGGAGAAG CACTCTCCCC ATCACCTCAG GGATATAACG GACAAGGGCT GAATGGCGTG GGCTGGGGAA CAGGAGTCTC GAAAAATTAT GCCTGGGTAG GCACTTTTAA CAACAAGATC GGGGTCTTTG ATCTTGAAGA TGGCAAACCA CTTGGCCCCG CGACTGTTGA TGGCGAGATC GGCCAACTGC AAGCAGTAAC AACCGCCGCG AATGGAGATG TATGGATTGC AGATAACACC AAGGATCACA TGATTCGGTT CCCCGATGGT GACTATAAGA ATGGCGAGCG ATTAACAATC AAAGGATTAC AAGCTCCTTT CGGTATTGCT GTTGACCAGC AAAATAGAGT CTGGGTAACC AGCAGCTACA ACAATAAACT CACCATCTTC CCTGGAGAAA ACCCTGAGGC AGTGAAAACA ATTGATATCG CTTTAGGAGC AAGAGGTGTG GCAATCGATT CCAAAGGCAA TGCTTGGGTC GCCCAGCAGA CTGATTCAGC AACGCTAGTT TTACCTCCTG GGGTCAGCAA GGCACCTCCA CGCCCAACAA CAATCATGCA GGAATTCATG CAAGGGCTTG AATATGCAAA AGCAAATCCG CAGCAAACAA GCTCTGGAAT GATTGCACTG ATCTCACCTG ATTTGAAGAT GATTAAATCA GACATTGCAA AAGGAGATGT CTATATCCCC TGGGGCGTAA GCATCGATGG CAATGACAAT GTATGGGTTG CAAATCTTTT TGGTAGCAGT TGA
|
Protein sequence | MKPISFPIIG GALLGSLSVL LASCDVSSIT GNASSFSGRV SVLGKPVSEA KVLLWQSSAK DGAKKIVETK TSKDGAFKLS IPRNKDKVTY LISEGGSIDG QDAGNLKMLS VLDSNSNKSV VINELTSVGS VWPNAQLINE NKISGSKTAL AIGSGHVSHL VNTSTGKYGE TLLNGLNLPN SETMVRLNVL SNLLALCGNS NTADGCSQLL NLTNSDNSLN ALISIAKQPW AKASELYKLF TQQYPLNKAE QIREGATLPY LLFEPESFSL SIRLEGGGAM ALGKMMFDDN ANMWSGANWM PGSQSGVINS IGGGLTRFNA SGEALSPSPQ GYNGQGLNGV GWGTGVSKNY AWVGTFNNKI GVFDLEDGKP LGPATVDGEI GQLQAVTTAA NGDVWIADNT KDHMIRFPDG DYKNGERLTI KGLQAPFGIA VDQQNRVWVT SSYNNKLTIF PGENPEAVKT IDIALGARGV AIDSKGNAWV AQQTDSATLV LPPGVSKAPP RPTTIMQEFM QGLEYAKANP QQTSSGMIAL ISPDLKMIKS DIAKGDVYIP WGVSIDGNDN VWVANLFGSS
|
| |