Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9303_21441 |
Symbol | |
ID | 4777323 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9303 |
Kingdom | Bacteria |
Replicon accession | NC_008820 |
Strand | + |
Start bp | 1906626 |
End bp | 1908041 |
Gene Length | 1416 bp |
Protein Length | 471 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 640087652 |
Product | putative multidrug efflux transporter, MFS family protein |
Protein accession | YP_001018144 |
Protein GI | 124023837 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2814] Arabinose efflux permease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGCGAAC ATAACCTCGG CACCACAAGC ATCTTGAGCG GTCCAACTCT AAACAGCCCC GAACCGGCCC TACCGACCGG CTCGAATCCA GAAGGACGCC GCGGATTGGC AGCGGTGATC AAGCTGGATG GCTTTCGCCG GCTGTGGATC GGTCAGATCT TCTCCCAACT GGCCGACAAG TTCTACATCG TTTTGATGGT GTACCTGATT GCCCAGTACT GGGTGAGCAG TACCCCTCAA AGCAATGAGG CGCTTGCAGA GGTGGCGGCG GCGATTCGGA TGGATTTCGA AACCCGCGCC CAAAAGATCA CCCTGCTTGC CACAGGCATT TATGTAGCCA ACACCATTCC AGCAATGTTG CTAGGAACCG TGGCGGGTGT CTGGGCTGAT CGCTGGCCGA AACGTCGCGT GATGGTGGCC TCCAACGCGC TGCGTGCTCT GCTGGTGCTC TTGGCCCCAG TTTGTCTCTT GCCAGGTCCC CATTGGCTGG GTCTCAGCTG GGGGTACTGG GCACTGCTGG TGATGACCTT TCTGGAGTCA GTTCTCACCC AATTCTTCGC GCCAGCCGAG CAAGCAACCA TCCCTCAACT GGTGCCAAAG GAACATCTCC TGGCTGCCAA TTCCCTGTAT CAAGCAACCA GCATGGGCGC CACGATTGTG GGCTTTGCCC TTGGTGATCC AATCTTGCGT CTACTCAAAC AAGCCTTCCT CTCCCTAGGA CTGAATGGAG GGGAATTTCT GCTCCTACCG TTCTGCTACG GCATGGCAGC GATCAGCCTG AGCACGATCC AGATGCAGGA GCAACCTCGC GACAGCTCTA GAGAAAGTGT CTGGAAGGAA ATCGGCGATG GTCTGCAGGT GCTTCGTCAA CAGGCTGCTG TGCGAGGGGC AATGCTCCAC CTTGTTGTGA TCTATAGCCT GCTAGCCGCC ATGTACGTGC TGGCCATCAC CTTGGCAGGT TCGATCAAAG GTTTGGGGCC AACTGGTTTC GGCATGATCT TGGCGATGAG TGGGATTGGG ATGGCCCTGG GGGCTGTGGT GATGGCCCAA ACCGGCCACC GCTTCAACCG CCGCAACTTG TGCGCAACCG GCCTGGGCAC AATCGCCTGC ACCCTTGTGC TGCTTGGGGG AGTCCTTGGT TCTCTGGGAC CAACTCTTTT GCTATGCGGC TTGCTGGGCG TGGGAGCAGC CCTTGTAGCG ATTCCAGCCC AGACCACCAT TCAAGAAGAC ACGCCAGAAG CACAGCGCGG CAAGGTGTTT GGTCTACAAA ACAACCTCAT CAACATCACC CTGAGTCTGC CTCTTGTGCT GGCTGGCGCT CTGGTGGGCA GCTATGGGCT CGTGCCAGTG CTGTGGTTGC TTGCTGCACT TGCCCTAGTA GCAGCTTTAT TAGAGCGCCC TTGGCAGCGC TGCTAG
|
Protein sequence | MGEHNLGTTS ILSGPTLNSP EPALPTGSNP EGRRGLAAVI KLDGFRRLWI GQIFSQLADK FYIVLMVYLI AQYWVSSTPQ SNEALAEVAA AIRMDFETRA QKITLLATGI YVANTIPAML LGTVAGVWAD RWPKRRVMVA SNALRALLVL LAPVCLLPGP HWLGLSWGYW ALLVMTFLES VLTQFFAPAE QATIPQLVPK EHLLAANSLY QATSMGATIV GFALGDPILR LLKQAFLSLG LNGGEFLLLP FCYGMAAISL STIQMQEQPR DSSRESVWKE IGDGLQVLRQ QAAVRGAMLH LVVIYSLLAA MYVLAITLAG SIKGLGPTGF GMILAMSGIG MALGAVVMAQ TGHRFNRRNL CATGLGTIAC TLVLLGGVLG SLGPTLLLCG LLGVGAALVA IPAQTTIQED TPEAQRGKVF GLQNNLINIT LSLPLVLAGA LVGSYGLVPV LWLLAALALV AALLERPWQR C
|
| |