Gene P9303_29931 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9303_29931 
Symbol 
ID4777053 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9303 
KingdomBacteria 
Replicon accessionNC_008820 
Strand
Start bp2642502 
End bp2643809 
Gene Length1308 bp 
Protein Length435 aa 
Translation table11 
GC content53% 
IMG OID640088517 
Productmultidrug ABC transporter 
Protein accessionYP_001018988 
Protein GI124024681 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.910726 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGTCAA GTCTCTATGG GATCGTGACA TTCATCCTGA CTCATTCCCA ATTGCGCCGA 
CCACAGATTC CCATCTTTCT CTGTGCCTTT GTCACTCTGT TGAACGATCG CCTAGGTGAA
ACCCTCTTAC TGCCATTACT TCCATACCTC CCAGGACGCT TCACAGACAG CGGCACAATC
CTGGGGCTAC TGGGAGGTAC TTATGCATTG GCTCAATTCG TCGTGGCTCC CCTAATTGGC
GCTCTCAGTG ATCGCTTTGG CCGCAAACCA GTTTTAACCG CTTGCGTTGC TGGCTCAGTA
GTAGGCCTTG GCTTATTCGC TATCACAATA TGGATTGATT GGAACATACT TCCAGCCGCT
TGGATCGGCA TTGTTCCCCT AATCCTTCTC TTCTCAGCAA GAATCATCGA TGGCGTTAGC
GGTGGCACAG CAAGTAGCGC CACCGCAGTG CTTGCCGACA TCTCAACACC TGAGAACCGA
GCAAAGGCAT TCGGCCTTAT TGGCGTTGCA TTCGGCCTCG GGTTCATCTT GGGCCCTTAT
ATCGGTGGCC GCTTAGCAGA GATCAATATC GCTCTACCCG GTATAGCCGC CACAGCCTTC
GCAGTCGCGA ACCTGCTCCT TGTGATCTAT ATCCTTCCCG AAACACACCC GCCAGCAGCT
CGCAATTCCC TACCAAGCAA AAGACAACTC AACCCGATCA CCCAGCTAGC ACAGATCTTT
GCCAATCCAC TAGTAAGCCG CCTTTGTTTC GCCTTCTTCC TGTTCTTCAT GGCATTCAAC
GGTTTCACAG CTGTGCTGGT GCTTTACCTG AAGCAAGCCT TCTCATGGAC AGTCAGTTTG
GCAGGCCTGA CCTTTGCAGT TGTAGGCGTG ATCGCAATGG TGGTTCAAGG GCTGCTAATC
GGTCCACTGG TCAAATCCTT CGGCGAATGG CGGCTCACCA TTGCTGGCAT TGGCTTCGTC
ATTGCAGGCT GTCTGCTATT GCCCATGGCC ACTCAGCAGA ATTCGATTTC TGTTGTATTC
ACTGCCGTAT CGGTACTAGC CCTTGGCACA GGTCTAGTCG TGCCATGCTT GAGAGCGCTT
GTCTCCAGAC GCCTCGACAA CGCTGGCCAA GGGGCAGTAC TCGGTAGCCT TCAAGGTCTT
CAGAGTCTAG GGACCTTCCT TGGTGCAGCC GCCGCAGGAT TCGCCTACGA CCAAATAGGC
ACTCGCAGTC CCTTCTGGCT GGCCAGCCTC GTACTAGTGG GAGTGATTGC CCTTGTTGCA
GGAGGCCTGC CTGCAAGCAC AAGGAACACA ACAATCAAAC AATCATGA
 
Protein sequence
MASSLYGIVT FILTHSQLRR PQIPIFLCAF VTLLNDRLGE TLLLPLLPYL PGRFTDSGTI 
LGLLGGTYAL AQFVVAPLIG ALSDRFGRKP VLTACVAGSV VGLGLFAITI WIDWNILPAA
WIGIVPLILL FSARIIDGVS GGTASSATAV LADISTPENR AKAFGLIGVA FGLGFILGPY
IGGRLAEINI ALPGIAATAF AVANLLLVIY ILPETHPPAA RNSLPSKRQL NPITQLAQIF
ANPLVSRLCF AFFLFFMAFN GFTAVLVLYL KQAFSWTVSL AGLTFAVVGV IAMVVQGLLI
GPLVKSFGEW RLTIAGIGFV IAGCLLLPMA TQQNSISVVF TAVSVLALGT GLVVPCLRAL
VSRRLDNAGQ GAVLGSLQGL QSLGTFLGAA AAGFAYDQIG TRSPFWLASL VLVGVIALVA
GGLPASTRNT TIKQS