Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9303_01081 |
Symbol | |
ID | 4776122 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9303 |
Kingdom | Bacteria |
Replicon accession | NC_008820 |
Strand | - |
Start bp | 107923 |
End bp | 109293 |
Gene Length | 1371 bp |
Protein Length | 456 aa |
Translation table | 11 |
GC content | 38% |
IMG OID | 640085607 |
Product | hypothetical protein |
Protein accession | YP_001016128 |
Protein GI | 124021821 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR03573] N-acetyl sugar amidotransferase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.244523 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACTCGCT ACCCCTTCCC AAAAGAAACG GATAAAGCTC TCTACAGAGA TGCAGTAAGT AATGATGCGT ACTATGGATT GCCACAACCA GTTAGCTTCT GCAAATTATG TGTTATTAGT AATCAACGTC CCTCAAGCAC GATTGAATTT AAAAATAACG GTACAAAGCC TAAAACAGTA ATACAATTTT CAAAGGACGA AATATGTGAT GCCTGTCGAG CTAAGAAGCA AAAATCGGCA ATTGATTGGG ATGAAAGGGC AAAGGAGCTG AAAAATCTAT GTGATCGCTT TAGAAAAACA GATGGAGGAT ACGATTGCTT AATTCCAGGG AGTGGGGGAA AGGACAGTTT TATGCAAGCA CATATTTTAA AGTATGAATA TGGGATGAAT CCATTAACAT GTACTTGGGC ACCAAATATT TATACTGATT GGGGCTGGAA AAATCATCAG GCATGGATTC ATGCAGGATT TGACAATATA CTTTTTACAC CGAATGGAAA AGTTCATCGG CTGATCACAA GATTGGCAGT AGAAAATCTC TTTCACCCTT TTCAACCTTT TATTCTAGGG CAAAAAAATC TCGGACCAAA GATTGCAGAT TTATATAATA TTAATCTCGT CTTTTATGGT GAAAACGAGG CTGAGTATGG GAATCCTATG GCTGATATGT CCTCTGCACT AAGAAACTGG GAGTATTTTA CCGCTTCTAA TGAAGATGAA ATTTATTTAG GAGGAGCATC ACTCAGTGAA CTCAGAGAAT TAGGATTGAA GGATAGTGAT TGGGAGATAT ACCTACCAAT CGACCCTAAG ATTATCAGTA AAAAGCAAAT AGAAGTGCAC TATCTTGGTT ATTATAAAAA ATGGCATCCC CAGGCTGCTT ATTATTACGC GATTGCGCAT GGCAATTTTC AGAGTTCGCC AGAAAGAACA ATAGGCACAT ACAGTACATA CAATTCAATT GATGATAAGA TCGATGATTT CCATTATCAT ACAACTTTTA TCAAATTTGG AATAGGGCGT GCTACATATG ATGCTTCACA AGAAATACGT TCAGGGGATC TAGTAAGAGA AGAAGGGGTT GCATTGGTGA AAAAGTTTGA TGGAGAATAT CCTGAAAGGT GGGCGGATGA AATATTTAAA TACCTAAGTC TTCCTATGAA TGAATTCCCA ATAGCATCAA AGATGTTTGA AGAGCCAATC TTTAACAAGA CATACTATGA GCGACTGTGT GATAAATTCA GATCACCTCA TTTATGGACA TGGGATCGTG ACATCGGATG GAAGTTACGC CATCAGGTAT CGAACAATAA TATCGATCAA AAAGAAACTG ATCTTCTAGC ATGGGAAGGC AATCAAGCAA AAATACAGTA A
|
Protein sequence | MTRYPFPKET DKALYRDAVS NDAYYGLPQP VSFCKLCVIS NQRPSSTIEF KNNGTKPKTV IQFSKDEICD ACRAKKQKSA IDWDERAKEL KNLCDRFRKT DGGYDCLIPG SGGKDSFMQA HILKYEYGMN PLTCTWAPNI YTDWGWKNHQ AWIHAGFDNI LFTPNGKVHR LITRLAVENL FHPFQPFILG QKNLGPKIAD LYNINLVFYG ENEAEYGNPM ADMSSALRNW EYFTASNEDE IYLGGASLSE LRELGLKDSD WEIYLPIDPK IISKKQIEVH YLGYYKKWHP QAAYYYAIAH GNFQSSPERT IGTYSTYNSI DDKIDDFHYH TTFIKFGIGR ATYDASQEIR SGDLVREEGV ALVKKFDGEY PERWADEIFK YLSLPMNEFP IASKMFEEPI FNKTYYERLC DKFRSPHLWT WDRDIGWKLR HQVSNNNIDQ KETDLLAWEG NQAKIQ
|
| |