Gene P9303_16391 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9303_16391 
Symbol 
ID4777743 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9303 
KingdomBacteria 
Replicon accessionNC_008820 
Strand
Start bp1431872 
End bp1432954 
Gene Length1083 bp 
Protein Length360 aa 
Translation table11 
GC content54% 
IMG OID640087148 
Productpermease 
Protein accessionYP_001017648 
Protein GI124023341 
COG category[R] General function prediction only 
COG ID[COG0628] Predicted permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.216071 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAAATTCT CCCAATGGCT TGCTCTGGCT GCACTGATGG CTGCGGGTGT GCTGTTTTGG 
AGCCTTAGAG AGGTGCTGAT CCACCTCTTT GCAGGCGTTG TTCTTGCCAT GGCTCTCTGC
ACTCTGGTGG GTGAGCTGCG CTCAAGGAAA CCCATGCCTC GCAGCATGGC CCTCCTGATC
TGCTTAGTCG CGCTTGTGCT TGTTGTGAGT ATGGCGACCG CCATTGTGGT ACCCCCCTTT
ACAGAGCAAT TCCACCAATT GCTTTTACAA CTACCTTCAG CAGCTAAGGA ACTCTGGAAG
CTGGCTATCG GCGCCATCAA CCAAACCTCT GCAATGGTCT ACGGAGTTAA CAACAGCAAG
GGTGGCTGGG AAGAACAGCT GTTTGCAAAT GGGTTAAACG CATTACCAGA TGGTGCAAGC
TTGGCCTCAG GAGTGAGGGA AGGCCTGCAA GGCCTTCTGG GCCTTGCAGG CAACCTCGGC
AGCGGCTTGG TGCAACTCCT GTTTGTGTTG GCGATGAGCT TGATGGTGGC GGTGCAACCG
ACGGCATACA GAGATGTAGC CATCTCGTTA TTGCCTTCGT TTTATAGGCG ACGAGCCCGC
TCGATCCTTA GCCAATGCGG AGATGCGCTG AGCAGCTGGA TGGTGGGTGT ACTGATCAGT
TCCTTTTGCG TGGCCCTGCT TGCAGCAATC GGCCTTTCAC TACTTGGGAT CAAGCTGGTG
ATGGCGAATG CCTTGCTTGC TGGGATGCTC AACGTGATCC CAAATGTGGG GCCTACCTTG
AGCACGATCT TTCCGATGTC GGTAGCGCTA CTTGATGCAC CCTGGAAAGC AGTCGCTGTG
CTTGGGCTGT ATGTGGTGAT TCAGAACCTG GAAAGTTATG TGATCACTCC ATCAGTGATG
CATCACCAAG TGAAATTGTT GCCTGGGCTG ACTCTCACAG CCCAGTTTGT ATTCACAGTT
GTGTTTGGTC CTCTCGGCCT GCTGTTGGCG CTGCCACTCG CTGTAGTGCT GCAAGTGTTG
ATTCGCGAGG TGGTCATCCA CGATTTGCTT GATCCCTGGA AGAAAAAGCG ATTGGCGCCA
TGA
 
Protein sequence
MKFSQWLALA ALMAAGVLFW SLREVLIHLF AGVVLAMALC TLVGELRSRK PMPRSMALLI 
CLVALVLVVS MATAIVVPPF TEQFHQLLLQ LPSAAKELWK LAIGAINQTS AMVYGVNNSK
GGWEEQLFAN GLNALPDGAS LASGVREGLQ GLLGLAGNLG SGLVQLLFVL AMSLMVAVQP
TAYRDVAISL LPSFYRRRAR SILSQCGDAL SSWMVGVLIS SFCVALLAAI GLSLLGIKLV
MANALLAGML NVIPNVGPTL STIFPMSVAL LDAPWKAVAV LGLYVVIQNL ESYVITPSVM
HHQVKLLPGL TLTAQFVFTV VFGPLGLLLA LPLAVVLQVL IREVVIHDLL DPWKKKRLAP