Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9303_19521 |
Symbol | |
ID | 4778374 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9303 |
Kingdom | Bacteria |
Replicon accession | NC_008820 |
Strand | - |
Start bp | 1716248 |
End bp | 1717690 |
Gene Length | 1443 bp |
Protein Length | 480 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 640087462 |
Product | hypothetical protein |
Protein accession | YP_001017959 |
Protein GI | 124023652 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2814] Arabinose efflux permease |
TIGRFAM ID | [TIGR00879] MFS transporter, sugar porter (SP) family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGCAGTTAG CAAATAAGCA GCTTTTGGAT CCCTCTAATC AGCTTTCTGA TCAGAGAAAT AAGAGACGAC GTTTACGTTT CATTCTTCAG ATCGCAATTT CAGCAGCCTT AGGAGGGTTT CTCTTTGGCT ACGACACGGC AGTGATCAAT GGAGCTGTCG GTGCGATCGG TACCGCCTTC ACCGTCTCCA AGGAAACCCT CGGCTTTGCT GTGGCCTCAG CTTTGCTGGG TTCCGCATTG GGAGCGTTCA CCGCTGGCTG GCTGTCTGAT CGAATCGGTC GTCGCAACAG CATGCTGGTT GCTGCACTGA TGTTTCTTGT TGGTTCCCTC GGTTCTGCTC TTGCTCCAAC GATCACCACC CTGATCCTCT GGCGGGTCGT TGGTGGTCTG GCCGTCGGTT TCGCCAGTGT GTTGGCGCCC GCTTATATCG CCGAGATCTC TCCTGCATCG ATGCGTGGAC AGCTTGGCTC ACTACAGCAG CTGGCGATTG TTATCGGTAT TTTCCTGGCG TTGCTGTTCG ATTACGTCAT CGTTCTTTTG ACTGCTGATC AGAATCCCGT TTCATTGATC GGTCCTCTAG CGGCCTGGCG CTGGATGTTC ATGTCTGAAA TCATCCCTGC AGCTCTTTAC GCAGTACTGG TGATCGGCAT TCCAGAGAGT CCTCGCTATC TCGTGCAGAA AGGTTTGACG CAGCGTGCCA AGGCGGTGAT TGAAAAAACG CTGCATGAAC CTGCAGATCA GGTGATCGCC AGGATCCAGA GCAGCCTGGT TAACACCCAT CAAGGCAAGT TAAGTGAACT GTTCGATCGC CACACCATCC TGCTGCCGAT CATCTGGACT GGCGTGATGC TGGCGATCTT CCAGCAGTTT GTGGGCATCA ATGTGATCTT CTATTACTCC AGCGTCCTGT GGCAGGCCGT TGGTTTCAGC GCCAAGGACA GTCTGATTGT CACGGTGATC ACCTCGATCA CAAATGTCGT CACCACCTTC ATTGCGATTG CATTTATTGA TCGTCTCGGC CGCAAACCCC TGCTTTTGGC TGGTTCGGTT GTGATGGCGG TGAACCTCGG TGTGATGAGC TGGGCCTTTG CCGGTGCCCC TCTCGTCAAC GGTGCGCCCC ACCTTGCCGG GGCAGGAGCC ATTGTGGCTT TGATTGCCGC CAATCTGTTT GTGTTCGCCT TCGGTTTCTC CTGGGGACCG GTGATGTGGG TGATGCTCGG AGAAATGTTC AACAACCGCA TCCGTGCGGT GGCAATCGGG CTGTGCGCCA TGGTCAATTG GATTGCCAAT TTCTTAATTT CCGACACCTT CCCTGGCTTG CTGGAACGCT CGGGACCTGC ACTCGCCTAC GGCCTGTACG CCACGGCTGC TGCGATCTCG TTCTTCCTCG TGCTGTTTTT CGTCAGGGAG ACCAAAGGCA TGGAGCTCGA GGAGATGGCC TGA
|
Protein sequence | MQLANKQLLD PSNQLSDQRN KRRRLRFILQ IAISAALGGF LFGYDTAVIN GAVGAIGTAF TVSKETLGFA VASALLGSAL GAFTAGWLSD RIGRRNSMLV AALMFLVGSL GSALAPTITT LILWRVVGGL AVGFASVLAP AYIAEISPAS MRGQLGSLQQ LAIVIGIFLA LLFDYVIVLL TADQNPVSLI GPLAAWRWMF MSEIIPAALY AVLVIGIPES PRYLVQKGLT QRAKAVIEKT LHEPADQVIA RIQSSLVNTH QGKLSELFDR HTILLPIIWT GVMLAIFQQF VGINVIFYYS SVLWQAVGFS AKDSLIVTVI TSITNVVTTF IAIAFIDRLG RKPLLLAGSV VMAVNLGVMS WAFAGAPLVN GAPHLAGAGA IVALIAANLF VFAFGFSWGP VMWVMLGEMF NNRIRAVAIG LCAMVNWIAN FLISDTFPGL LERSGPALAY GLYATAAAIS FFLVLFFVRE TKGMELEEMA
|
| |