Gene P9303_16801 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9303_16801 
Symbol 
ID4776484 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9303 
KingdomBacteria 
Replicon accessionNC_008820 
Strand
Start bp1468635 
End bp1469846 
Gene Length1212 bp 
Protein Length403 aa 
Translation table11 
GC content47% 
IMG OID640087189 
Productputative permease 
Protein accessionYP_001017689 
Protein GI124023382 
COG category[R] General function prediction only 
COG ID[COG0795] Predicted permeases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.636863 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGAGCAAAT GGCGCCTTGG AAAGTTGAAA GACGTTGATT CTTGGCCTTG GAAGTGGTGG 
CGCTGGGCTT CGTCCAAGTG GAGGCTGATT CCGCTCCTGG ATCGATGGCT TTTAGGTGAG
TTAATACCCG TATTGCTGTT TGCCATAGCG GCCTTCACGG TGGTTTCCGT CTCAGTGGGG
GTGATGTTTG ATTTGGTGCG CAAGATTGTT GAGTTTGGAT TGCCTCTGCA TTTTGCATTG
CAGGTTTTTA GCTTGAAGTT ACCTTTCTTT CTGGTGATTT CTTTTCCGAT GGCCACCTTG
TTGGCCACCT TGTTGGCTTA TAGCCGTCTT TCTTCTAACA GCGAATTCAC GGCTTTGCGC
AGCCTCGGTG TAAGCACCAG ACGAATCGTT GCTCCAGCAT TGGCATTGGC ATTATTGATG
ACAGGGCTAA CTTTTATCTT CAATGATGTC ATTGTTCCAA GAACAAATAG TAGTGCTGAG
GTGACGATAA AGCGTGCTCT TGGCAAAGCA ATTGCTACAG AGAAAGGCAA ACATGTTGTT
TATTCCCGAT TCGGAACGAT TACTGGTACA AAAGCTGAAG ATAGGAATCA AGGGCTTTCT
CAGTTGTTTT ATGCAAGGGA ATTCCTTAAG GGAGAGATGG AGGATGTCAC AGTGCTTGAT
TTGTCTCGTC TTGGTTTTAC TCAGATGTTG AAGGCCGAGC GGGCTATTTG GAATGAGCAG
GAGGCGATGT GGGAATTTTT TAACGGTAAT ATCCTTACTC TCACACCAAG TGGCAGTACC
ACTTCAGTTG AATTTGATCG CTATCTCTAC CCCCTAACAT CAGGTCCAAT TCGTGTGGCC
AAACTCCCTA AAGATGCCAA CAACATGACT GTTGCACAGG CAATGGAGGC TGCAAGTATT
TATACCGATG CTGGTAACCG TAAAGAGGCA AGGCGTTTGA AGGTGCGTAT CCAAGAAAAA
TTCACTTTCC CGATGTCCTG CCTGGTGTTT GGCCTGATTG GCAGCAGCCT TGGTGCAAAA
CCTAATTCTC GGACCAGCCG GACCCAGGGA TTTGGCATCA GTCTCTTATT AATTCTTGCT
TATTACACGC TTAGCTTCAG CTTCAGTTCT CTAGGCGTTA CGGGCACTTT GACACCGATG
TTGGCAGCGT GGGTCCCTGT TTTTATCTCT TTGGCAGGTG GCGGTCTGTT GCTGCGTCAG
GCCAGTCGTT GA
 
Protein sequence
MSKWRLGKLK DVDSWPWKWW RWASSKWRLI PLLDRWLLGE LIPVLLFAIA AFTVVSVSVG 
VMFDLVRKIV EFGLPLHFAL QVFSLKLPFF LVISFPMATL LATLLAYSRL SSNSEFTALR
SLGVSTRRIV APALALALLM TGLTFIFNDV IVPRTNSSAE VTIKRALGKA IATEKGKHVV
YSRFGTITGT KAEDRNQGLS QLFYAREFLK GEMEDVTVLD LSRLGFTQML KAERAIWNEQ
EAMWEFFNGN ILTLTPSGST TSVEFDRYLY PLTSGPIRVA KLPKDANNMT VAQAMEAASI
YTDAGNRKEA RRLKVRIQEK FTFPMSCLVF GLIGSSLGAK PNSRTSRTQG FGISLLLILA
YYTLSFSFSS LGVTGTLTPM LAAWVPVFIS LAGGGLLLRQ ASR