Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9303_28591 |
Symbol | |
ID | 4778307 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9303 |
Kingdom | Bacteria |
Replicon accession | NC_008820 |
Strand | + |
Start bp | 2528631 |
End bp | 2530943 |
Gene Length | 2313 bp |
Protein Length | 770 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 640088382 |
Product | hypothetical protein |
Protein accession | YP_001018854 |
Protein GI | 124024547 |
COG category | [V] Defense mechanisms |
COG ID | [COG1131] ABC-type multidrug transport system, ATPase component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGTGTTG TTCTTTTTAG TCGTTGTCGC AACGAAGAGA CCAATGTCTC TTTGCCTGTT GGTGAGTTGA GCATTGGGCG TTCGCCTGAT GCAGACATCA ACTTGCCAAC AGATTGGACC TTGCTTTCCG CTCTACATCT GAGCTTGCAG GTCACCAGTG ATGGAAGCCT GGCTGTTCGT GATGGGGTGG CAGGCAAGCC AAGTACCAAT GGAACCCAGC TCAACTGCGG TTATCTCTCA GCTGATCGCT GGACACCCTT GCGACTTGGC GATGAACTGC AAATTGGCAG TCAAATTAAA GATGCAGTGC GACTGGCTGT CCTTTCAGAT GCCACTAACA CCAGTCAAGG TTCATCAGCT GAGCAGCAGC GCCAATGGCA GTTAGAGGGC AGACGCCTAA CGATGGGCCG TGGCCTGGAC TGTGACGTCA AATTGAGCGG TCCAACGATA TCGAGATTGC ATTGCTCAAT TAATCGCTCT GGCAATGACA TCGTTCTTCT TGATCAAAGC CGTAACGGCA TCTTTGTGAA TGATCGGCCG GTCAATCGCC AAGTGCGATT AAGGGATGGC GATCAGATCA AGGTTGGAAC CTCAGTCTTT GTATGGAGTA CCCCTTGGCT CAGTCGCCAA ACCAGCGGCA AAAGCTATCG AATTGATGTG CGCGACCTAT GGCTCAAGGG TCGAATCAGT GGCAGCAACC TTTCTATTGA GCCCGGGCAG CTTGTTGCCT TTGTTGGCGG CAGTGGAGCA GGTAAATCAA GCCTGCTAAC CACCATCGTT GGCCAAAACC TTGACTATCA AGGTCAGATT TTGGTCAACG GCAATGAGTT GCGGGAGACC TATGGCGCCA TCAAGCAGGA AATTGGCTTC GTTCCACAAG ACGACATTGT TCATCTTGAT CTCACTGTAG AAGAGGTCTT GCGCTATTCC GCGCGACTGA AGCTCCCAGA CGTGGACGAG CAACGCGCAG CCGTTGAACG TGTGTTGGAT GAATTGGAGA TCAGCCATCG CCGCAAGGCA TTGGTGCGAG AACTAAGCGG TGGACAACGC AAGCGGGTGA GCATTGGCGT CGAGTTGATA GCCGACCCTC GGATCCTGTT TCTTGATGAG CCCACATCCG GCCTCGATCC TGGCCTTGAC AAACGGATGA TGGAATTACT GAGAAGCCTT GCAAATTCAG GGCGAACCGT GGCTTTGGTA ACCCATGCCA CCAACAACGT GATGCTTTGT GATCAGGTTG TTTTCCTCGC TCGAGGTGGA CAGCTTTGCT ACGCCGGTCC ACCTAGCCAG TGCCTTGATC ATTTCCAGTT AACTGGCGAT TTTTCTGATA TCTATCAGTA CCTAGAACGC ACTGATCAAG AGATTGCCTC AATCGCAGAT AGCTATCGAG CAGAGATCTT AAAAGTCTTG CCGAAGGTTT CCAGTCAATC TGGTTCGACA TCACATTCAC TCGAAACCAA ACGAGCAGGC CGGCTTGGAC TTGTGATTCA ACAGTTCCGC ACACTGCTCT CTCGTGATGC CATCCTTACG TTTAGAGACA GCACCTCACT GGTTCTTAAT GCAGTGACAG CTCCGTTGGC TGTTCTGATG ATTGCTTTTG CTGCAAATAA CAGGCAGATA TTCTCAGACC TCGATGCCAT TGACGCCAGC ACATATCCCG ATGCATTGAG AGTCCTATTT GTAATTATTT GCGCCACGAT CTGGGTTGGA CTTTCAACAT CGCTGCAGTC GCTTGTAAAA GATCGAGGAA TTTTCTTGCG GGAGAGGTCA TTCAATCTGC TGCCGGAGTC TTACCTGAGC GCAAAAATCA TTGTGATGTT TTTTCAGGCT GTTGTTCAGT CTCTATTGAT CCTGGCCACT GTCAAGATCT TCTTTGACTC ACCAGACACA ACATTCCTCA ATTGGCCATT ATCAATCGCC ATGGTTTGTT TTACAACCCT GATCACGATT GGATCGCAGG CGTTGATGAC ATCAAGCCTG GTCAAGAACA GTCAGCAAGC CAGCAGCATC GCTCCACTGC TATTAATACC TCAGCTCATC TTTGGCGGAG TGTTGTTTAC GCTCAGCAAA ACCGCCGATG ATATTTACCC ATTAATAACA AGCCGCTGGG CAATGAAAGC AATGGGAATT TACTCTGATA TCACAGAGTT GATTCCAGGT GGCCAGACAG CTATCAACCA GATGCCTGGC GCTAGTTCCT ACGAAGCAAC TTTAAGCAAC TTGCATAGCT CATTCCTGCT TATGGCTATT CAATGCGCTG CTTTTTTAAT GCTAACACTG GGTTCTCTAC TGTTTTTAAA ACATAATCGC TAA
|
Protein sequence | MSVVLFSRCR NEETNVSLPV GELSIGRSPD ADINLPTDWT LLSALHLSLQ VTSDGSLAVR DGVAGKPSTN GTQLNCGYLS ADRWTPLRLG DELQIGSQIK DAVRLAVLSD ATNTSQGSSA EQQRQWQLEG RRLTMGRGLD CDVKLSGPTI SRLHCSINRS GNDIVLLDQS RNGIFVNDRP VNRQVRLRDG DQIKVGTSVF VWSTPWLSRQ TSGKSYRIDV RDLWLKGRIS GSNLSIEPGQ LVAFVGGSGA GKSSLLTTIV GQNLDYQGQI LVNGNELRET YGAIKQEIGF VPQDDIVHLD LTVEEVLRYS ARLKLPDVDE QRAAVERVLD ELEISHRRKA LVRELSGGQR KRVSIGVELI ADPRILFLDE PTSGLDPGLD KRMMELLRSL ANSGRTVALV THATNNVMLC DQVVFLARGG QLCYAGPPSQ CLDHFQLTGD FSDIYQYLER TDQEIASIAD SYRAEILKVL PKVSSQSGST SHSLETKRAG RLGLVIQQFR TLLSRDAILT FRDSTSLVLN AVTAPLAVLM IAFAANNRQI FSDLDAIDAS TYPDALRVLF VIICATIWVG LSTSLQSLVK DRGIFLRERS FNLLPESYLS AKIIVMFFQA VVQSLLILAT VKIFFDSPDT TFLNWPLSIA MVCFTTLITI GSQALMTSSL VKNSQQASSI APLLLIPQLI FGGVLFTLSK TADDIYPLIT SRWAMKAMGI YSDITELIPG GQTAINQMPG ASSYEATLSN LHSSFLLMAI QCAAFLMLTL GSLLFLKHNR
|
| |