Gene P9303_19521 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9303_19521 
Symbol 
ID4778374 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9303 
KingdomBacteria 
Replicon accessionNC_008820 
Strand
Start bp1716248 
End bp1717690 
Gene Length1443 bp 
Protein Length480 aa 
Translation table11 
GC content55% 
IMG OID640087462 
Producthypothetical protein 
Protein accessionYP_001017959 
Protein GI124023652 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID[TIGR00879] MFS transporter, sugar porter (SP) family 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGCAGTTAG CAAATAAGCA GCTTTTGGAT CCCTCTAATC AGCTTTCTGA TCAGAGAAAT 
AAGAGACGAC GTTTACGTTT CATTCTTCAG ATCGCAATTT CAGCAGCCTT AGGAGGGTTT
CTCTTTGGCT ACGACACGGC AGTGATCAAT GGAGCTGTCG GTGCGATCGG TACCGCCTTC
ACCGTCTCCA AGGAAACCCT CGGCTTTGCT GTGGCCTCAG CTTTGCTGGG TTCCGCATTG
GGAGCGTTCA CCGCTGGCTG GCTGTCTGAT CGAATCGGTC GTCGCAACAG CATGCTGGTT
GCTGCACTGA TGTTTCTTGT TGGTTCCCTC GGTTCTGCTC TTGCTCCAAC GATCACCACC
CTGATCCTCT GGCGGGTCGT TGGTGGTCTG GCCGTCGGTT TCGCCAGTGT GTTGGCGCCC
GCTTATATCG CCGAGATCTC TCCTGCATCG ATGCGTGGAC AGCTTGGCTC ACTACAGCAG
CTGGCGATTG TTATCGGTAT TTTCCTGGCG TTGCTGTTCG ATTACGTCAT CGTTCTTTTG
ACTGCTGATC AGAATCCCGT TTCATTGATC GGTCCTCTAG CGGCCTGGCG CTGGATGTTC
ATGTCTGAAA TCATCCCTGC AGCTCTTTAC GCAGTACTGG TGATCGGCAT TCCAGAGAGT
CCTCGCTATC TCGTGCAGAA AGGTTTGACG CAGCGTGCCA AGGCGGTGAT TGAAAAAACG
CTGCATGAAC CTGCAGATCA GGTGATCGCC AGGATCCAGA GCAGCCTGGT TAACACCCAT
CAAGGCAAGT TAAGTGAACT GTTCGATCGC CACACCATCC TGCTGCCGAT CATCTGGACT
GGCGTGATGC TGGCGATCTT CCAGCAGTTT GTGGGCATCA ATGTGATCTT CTATTACTCC
AGCGTCCTGT GGCAGGCCGT TGGTTTCAGC GCCAAGGACA GTCTGATTGT CACGGTGATC
ACCTCGATCA CAAATGTCGT CACCACCTTC ATTGCGATTG CATTTATTGA TCGTCTCGGC
CGCAAACCCC TGCTTTTGGC TGGTTCGGTT GTGATGGCGG TGAACCTCGG TGTGATGAGC
TGGGCCTTTG CCGGTGCCCC TCTCGTCAAC GGTGCGCCCC ACCTTGCCGG GGCAGGAGCC
ATTGTGGCTT TGATTGCCGC CAATCTGTTT GTGTTCGCCT TCGGTTTCTC CTGGGGACCG
GTGATGTGGG TGATGCTCGG AGAAATGTTC AACAACCGCA TCCGTGCGGT GGCAATCGGG
CTGTGCGCCA TGGTCAATTG GATTGCCAAT TTCTTAATTT CCGACACCTT CCCTGGCTTG
CTGGAACGCT CGGGACCTGC ACTCGCCTAC GGCCTGTACG CCACGGCTGC TGCGATCTCG
TTCTTCCTCG TGCTGTTTTT CGTCAGGGAG ACCAAAGGCA TGGAGCTCGA GGAGATGGCC
TGA
 
Protein sequence
MQLANKQLLD PSNQLSDQRN KRRRLRFILQ IAISAALGGF LFGYDTAVIN GAVGAIGTAF 
TVSKETLGFA VASALLGSAL GAFTAGWLSD RIGRRNSMLV AALMFLVGSL GSALAPTITT
LILWRVVGGL AVGFASVLAP AYIAEISPAS MRGQLGSLQQ LAIVIGIFLA LLFDYVIVLL
TADQNPVSLI GPLAAWRWMF MSEIIPAALY AVLVIGIPES PRYLVQKGLT QRAKAVIEKT
LHEPADQVIA RIQSSLVNTH QGKLSELFDR HTILLPIIWT GVMLAIFQQF VGINVIFYYS
SVLWQAVGFS AKDSLIVTVI TSITNVVTTF IAIAFIDRLG RKPLLLAGSV VMAVNLGVMS
WAFAGAPLVN GAPHLAGAGA IVALIAANLF VFAFGFSWGP VMWVMLGEMF NNRIRAVAIG
LCAMVNWIAN FLISDTFPGL LERSGPALAY GLYATAAAIS FFLVLFFVRE TKGMELEEMA