Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9303_21341 |
Symbol | |
ID | 4775989 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9303 |
Kingdom | Bacteria |
Replicon accession | NC_008820 |
Strand | - |
Start bp | 1897690 |
End bp | 1898718 |
Gene Length | 1029 bp |
Protein Length | 342 aa |
Translation table | 11 |
GC content | 41% |
IMG OID | 640087642 |
Product | hypothetical protein |
Protein accession | YP_001018134 |
Protein GI | 124023827 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0683] ABC-type branched-chain amino acid transport systems, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.551591 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGATCCGA AACCTTTGAT ATTTGCCCAA TCACTTGTTC TTAGTGGCCC TGCACGTGAC CTGGGTAGGA ATTTTATAAA AGGCATTGAC CTTTATCTTA AAAAGGTCAA TGATCAAGGT GGTATAGATG GTCGCCCGAT CATGATATGG AGATTAGACG ATGGCTATGA ACCAGAAAAT GCTTATAATA ATACAAGTCA ATTTGTTAGT TACTCGCAAC TTCTTGGATT ATTTGGTTAT ATCGGCACAC CAACGACAAA AGCTTCACTC CCACTAGCTA AGGCTGCTGA AATTGATATT ATCTCACCAT TCACTGGAGC AAGTGTATTG CGTGGAGAAA ATAATGCCTA CACAATTCAT CATCGCGCTA GTTATGCGGA TGAAGCGAAA AGGATTGTTG ATTATTTGGT TAATGATGGA TTTGTTCGCA TCGCAATAGG ATATCAAAAT GATTCTTATG GCAAAGATGT ATTAAATAGT CTTATAGAGG AATTAGCTGA CCCTCATATG CTATCGCCAG TGATAAGTGT TCCTCTCGTG AGAAATTCAA GAGACACGGG TAATGCAGCA AAAGAAATTA AAGCTCATAA TCCCGATGCT CTTATTGCGA TTTCAACTTA CCAGACAGTC GCCAGTCTTA TTCAGAATCT TAATTCACAA GGTAGCTACC CTCAAGTTAT GACAATTTCA TTCACAGGTA CTAAGTCACT AATCAAAGAG TTGCCTCGTC ATACTTCATT TGGGATAGGC GTTACACAAG TCGTTCCTTT CCCATGGGAC CTTCGCAATC CCATAGTGCG AGACTATCAA CATGATCTAC GTCATGTTGA TTCAGATGCA GAATTTGATT TTGTGAGCTT GGAAGGCTAT CTGATCGCGA GGAAGCTTGT ATCGGCCTTA CGAAAAGCTT CACCTGTTAT CAATCGATCA AGCCTGAGAG ACGCATTGCT GGAGGAAAAT GATGGGCTTA TTTCTGGTGG TGTAGAAGTA GACTTAGTAT TTCTTGGCAC CGATCCATGG CAACCATGA
|
Protein sequence | MDPKPLIFAQ SLVLSGPARD LGRNFIKGID LYLKKVNDQG GIDGRPIMIW RLDDGYEPEN AYNNTSQFVS YSQLLGLFGY IGTPTTKASL PLAKAAEIDI ISPFTGASVL RGENNAYTIH HRASYADEAK RIVDYLVNDG FVRIAIGYQN DSYGKDVLNS LIEELADPHM LSPVISVPLV RNSRDTGNAA KEIKAHNPDA LIAISTYQTV ASLIQNLNSQ GSYPQVMTIS FTGTKSLIKE LPRHTSFGIG VTQVVPFPWD LRNPIVRDYQ HDLRHVDSDA EFDFVSLEGY LIARKLVSAL RKASPVINRS SLRDALLEEN DGLISGGVEV DLVFLGTDPW QP
|
| |