Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9303_28051 |
Symbol | |
ID | 4778568 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9303 |
Kingdom | Bacteria |
Replicon accession | NC_008820 |
Strand | - |
Start bp | 2472275 |
End bp | 2473960 |
Gene Length | 1686 bp |
Protein Length | 561 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 640088328 |
Product | ABC transporter ATP-binding protein |
Protein accession | YP_001018800 |
Protein GI | 124024493 |
COG category | [R] General function prediction only |
COG ID | [COG1123] ATPase components of various ABC-type transport systems, contain duplicated ATPase |
TIGRFAM ID | [TIGR02323] phosphonate C-P lyase system protein PhnK |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.779862 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGACGTTGA GCATCCGACC CAGCGGATCA ATCCATGGTA TTCAGTCTCA AGACTCCGTG CTACTAACGA TGCCCAGCTC TTCAGAGCAG GTACTCGAGC TCAGCCAACT GCGCATGCGC TATCCCCGCA GCGCTGATTG GACACTGGAT GGACTGAACC TAAGCATCAA GTCAGGAGAG AGACTCGCCC TGGTGGGCCC TTCAGGCTGC GGCAAGAGCA CCGTGGCCAA GGCCGTACTG CATCTACTTC CCCCTGGCAG CATTTGCCAG GGGGGGGTGC TACTGACCGG CCAAGATCCA AGACGATTGC AACAGAAGCG ATTGCGGAAA CTGCGTGGTG AAGCGGTAGG TCTGGTTTTT CAGGATCCAA TGACCCGCCT TAATCCGTTG ATGACGGTGG GAAAGCATTT GCTTGACACT CTCAACGCCC ACCAACCAGA AGCTACCCCC AGCTGGCGAG AGCAGCGGGC AGAGGAACTC CTGGAGCGAG TGGGAATCGG GGCGAACCGG TTCCGCGCTT ACCCCCATGA ATTCAGTGGT GGCATGCGTC AGCGGCTGTC CATTGCTCTG GCAATCGCTC TCAATCCTCC TCTGGTCATC GCCGATGAGC CCACCACCAG CCTTGATGTC GCCGTGGCTG GTCAGGTTAT GGCCGAACTC AGCAACCTCT GCGAGGAGCT CGGCAGTGCT CTCATGTTGA TCAGCCATGA CTTGGCGATG GCAGCTCGCT GGTGCGAGCG CATGGCCATT CTCGATGGGG GACGCATGGT GGAAGAGGGC CGGAGCGAAG AGCTGCTTTC TCATCCCCGC TCAGCTATCG GCACTCGGCT CGTGGGTGCC GCCAGGGCGC GTGAAGGCGG AAGCACTCCA ACTTGCTCGC ATACCGCTGC CGTCGTTCTA GAAGTCAATG CTTTGCGCTG TTGGCATGCT CTGGGAGGCT GGCCCTGGGC ACCGAGCTGG CTCAAAGCTG TTGATGGAGT GAGCTTCAAC ATCCGTGCTG GCGAGAGCCT TGGTGTGGTT GGAGCATCTG GCTGCGGCAA AAGCACCCTC TGCCGGGCCC TCATGGGTCT TACGCCGATC CGGGGTGGTC AGGTGCATCT CCAAGGCCAC AACCTGCTCA GCCTGCAAGG ACAGCCATTA AGGCAAGCCC GTCAGGCCTT GCAGATGGTT TTTCAAGACC CCCTCGCCTG CCTCAATCCA AAAATGACGA TCGCAGAAGC CATCGCCGAC CCACTGCTCA TCCATGGAAT GGCAAGCCGA GCCGAAGCAA GGCAAAACGG GCGAAAGTTA CTTGAGCAGG TGGGCCTCAG CCCAGCGGAG GACTATCAAA ACCGACTGCC TCGCCAGCTC TCTGGCGGCC AGCAACAACG CGTCGCCATT GCCCGAGCCT TGGCCTTAAA ACCGAAGGTG CTGATTTGCG ATGAGAGCGT CAGCATGCTG GATGCGGAAA TTCAAGCTGA GGTGTTGGCT CTATTGCGTC AGCTTCAGAA AGAATTGGGG CTCGCGATGC TGTTCATTAC CCACGACCTC TCCGTTGCCA GTGGTTTCTG CCACAGAGTG ATCGTGCTCG ACCATGGGCA AATCGTTGAG GAAGGACCCG GCGATGAGCT CCTTCAGCAC CCCCAAGCAG CGATTACCCG CACACTCGTA GAAGCCTGTC CGAGACTGCC AACAACAATG CCTTAA
|
Protein sequence | MTLSIRPSGS IHGIQSQDSV LLTMPSSSEQ VLELSQLRMR YPRSADWTLD GLNLSIKSGE RLALVGPSGC GKSTVAKAVL HLLPPGSICQ GGVLLTGQDP RRLQQKRLRK LRGEAVGLVF QDPMTRLNPL MTVGKHLLDT LNAHQPEATP SWREQRAEEL LERVGIGANR FRAYPHEFSG GMRQRLSIAL AIALNPPLVI ADEPTTSLDV AVAGQVMAEL SNLCEELGSA LMLISHDLAM AARWCERMAI LDGGRMVEEG RSEELLSHPR SAIGTRLVGA ARAREGGSTP TCSHTAAVVL EVNALRCWHA LGGWPWAPSW LKAVDGVSFN IRAGESLGVV GASGCGKSTL CRALMGLTPI RGGQVHLQGH NLLSLQGQPL RQARQALQMV FQDPLACLNP KMTIAEAIAD PLLIHGMASR AEARQNGRKL LEQVGLSPAE DYQNRLPRQL SGGQQQRVAI ARALALKPKV LICDESVSML DAEIQAEVLA LLRQLQKELG LAMLFITHDL SVASGFCHRV IVLDHGQIVE EGPGDELLQH PQAAITRTLV EACPRLPTTM P
|
| |