Gene P9303_21441 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9303_21441 
Symbol 
ID4777323 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9303 
KingdomBacteria 
Replicon accessionNC_008820 
Strand
Start bp1906626 
End bp1908041 
Gene Length1416 bp 
Protein Length471 aa 
Translation table11 
GC content58% 
IMG OID640087652 
Productputative multidrug efflux transporter, MFS family protein 
Protein accessionYP_001018144 
Protein GI124023837 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGCGAAC ATAACCTCGG CACCACAAGC ATCTTGAGCG GTCCAACTCT AAACAGCCCC 
GAACCGGCCC TACCGACCGG CTCGAATCCA GAAGGACGCC GCGGATTGGC AGCGGTGATC
AAGCTGGATG GCTTTCGCCG GCTGTGGATC GGTCAGATCT TCTCCCAACT GGCCGACAAG
TTCTACATCG TTTTGATGGT GTACCTGATT GCCCAGTACT GGGTGAGCAG TACCCCTCAA
AGCAATGAGG CGCTTGCAGA GGTGGCGGCG GCGATTCGGA TGGATTTCGA AACCCGCGCC
CAAAAGATCA CCCTGCTTGC CACAGGCATT TATGTAGCCA ACACCATTCC AGCAATGTTG
CTAGGAACCG TGGCGGGTGT CTGGGCTGAT CGCTGGCCGA AACGTCGCGT GATGGTGGCC
TCCAACGCGC TGCGTGCTCT GCTGGTGCTC TTGGCCCCAG TTTGTCTCTT GCCAGGTCCC
CATTGGCTGG GTCTCAGCTG GGGGTACTGG GCACTGCTGG TGATGACCTT TCTGGAGTCA
GTTCTCACCC AATTCTTCGC GCCAGCCGAG CAAGCAACCA TCCCTCAACT GGTGCCAAAG
GAACATCTCC TGGCTGCCAA TTCCCTGTAT CAAGCAACCA GCATGGGCGC CACGATTGTG
GGCTTTGCCC TTGGTGATCC AATCTTGCGT CTACTCAAAC AAGCCTTCCT CTCCCTAGGA
CTGAATGGAG GGGAATTTCT GCTCCTACCG TTCTGCTACG GCATGGCAGC GATCAGCCTG
AGCACGATCC AGATGCAGGA GCAACCTCGC GACAGCTCTA GAGAAAGTGT CTGGAAGGAA
ATCGGCGATG GTCTGCAGGT GCTTCGTCAA CAGGCTGCTG TGCGAGGGGC AATGCTCCAC
CTTGTTGTGA TCTATAGCCT GCTAGCCGCC ATGTACGTGC TGGCCATCAC CTTGGCAGGT
TCGATCAAAG GTTTGGGGCC AACTGGTTTC GGCATGATCT TGGCGATGAG TGGGATTGGG
ATGGCCCTGG GGGCTGTGGT GATGGCCCAA ACCGGCCACC GCTTCAACCG CCGCAACTTG
TGCGCAACCG GCCTGGGCAC AATCGCCTGC ACCCTTGTGC TGCTTGGGGG AGTCCTTGGT
TCTCTGGGAC CAACTCTTTT GCTATGCGGC TTGCTGGGCG TGGGAGCAGC CCTTGTAGCG
ATTCCAGCCC AGACCACCAT TCAAGAAGAC ACGCCAGAAG CACAGCGCGG CAAGGTGTTT
GGTCTACAAA ACAACCTCAT CAACATCACC CTGAGTCTGC CTCTTGTGCT GGCTGGCGCT
CTGGTGGGCA GCTATGGGCT CGTGCCAGTG CTGTGGTTGC TTGCTGCACT TGCCCTAGTA
GCAGCTTTAT TAGAGCGCCC TTGGCAGCGC TGCTAG
 
Protein sequence
MGEHNLGTTS ILSGPTLNSP EPALPTGSNP EGRRGLAAVI KLDGFRRLWI GQIFSQLADK 
FYIVLMVYLI AQYWVSSTPQ SNEALAEVAA AIRMDFETRA QKITLLATGI YVANTIPAML
LGTVAGVWAD RWPKRRVMVA SNALRALLVL LAPVCLLPGP HWLGLSWGYW ALLVMTFLES
VLTQFFAPAE QATIPQLVPK EHLLAANSLY QATSMGATIV GFALGDPILR LLKQAFLSLG
LNGGEFLLLP FCYGMAAISL STIQMQEQPR DSSRESVWKE IGDGLQVLRQ QAAVRGAMLH
LVVIYSLLAA MYVLAITLAG SIKGLGPTGF GMILAMSGIG MALGAVVMAQ TGHRFNRRNL
CATGLGTIAC TLVLLGGVLG SLGPTLLLCG LLGVGAALVA IPAQTTIQED TPEAQRGKVF
GLQNNLINIT LSLPLVLAGA LVGSYGLVPV LWLLAALALV AALLERPWQR C