Gene P9303_18531 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9303_18531 
Symbol 
ID4775994 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9303 
KingdomBacteria 
Replicon accessionNC_008820 
Strand
Start bp1612888 
End bp1614489 
Gene Length1602 bp 
Protein Length533 aa 
Translation table11 
GC content52% 
IMG OID640087362 
Productpermease 
Protein accessionYP_001017860 
Protein GI124023553 
COG category[R] General function prediction only 
COG ID[COG2252] Permeases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.632317 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCATTCAG ATCTCAAGCA GAAGCACTGG AGGCCGCAAT GGTTCGTCCG TGGAGATGTC 
GACGGATTCC TAGGACTTGG CCTCGATAAC TTGATTCAAG TCCTCTTGAT AATTGCCTTA
TGCCGCAACG TGCTCGGCTA TCCGAATGAA TTAGTTTTCG GCACAATCCT TCCTGCCACA
GGAATAAGCC TGCTGATAGG CAACCTTGCC TACGCGCATC AGGCTCATCA ACTTGCCAGC
ATCGAAAAAC GTAGTGATCG GACAGCTCTG CCCTACGGAA TTAACACCGT GAGCCTGTTT
GCTTATGTGT TTTTGGTAAT GCTCCCTGTG AAACTCACCG CACTGGGCCA GGGAATGGAT
GAGGCAAGCG CTGTTCGCCT CTCCTGGCAA GCAGGGATGG TGGCTTGCCT GGGCTCTGGG
CTGATCGAAA CAGTAGGGGC TTTTACAGCA GGGGTACTGC GTCGCTGGCT GCCTAGGGCT
GCATTACTTG CCACTCTCGC CGGAATTGCT CTTGGCTACA TCGCCCTTGG TTTTCTGCTG
CGTACCTATG CCCAACCTGT AGTGGGGCTG ACAGTACTAG CAATTGTTCT GGTCACCTAT
TACGGACGCT TGCGATTGCC CATTCCTGGT GGCTTGCTTG CTGTTGTTAT TGGCGTGGGG
CTTGCTTGGA GTACTGGATT AATCGATAGC GACGCTAGCC GTTGGAGTCA GGAAGCAAGT
CAGATTGGCT TGCGCCTTCC TCACCTAGAA ATCGCCAATC TTTGGCATGC TCGTGGCCAG
CTGCTGCCCT GGCTAGGGGT GATTGTGCCA ATGGGCCTAT TCAACGTGCT GGGTTCTCTT
CAAAACATCG AAAGCGCAGA GGCGGCAGGT GATCGTTATT GCGTTCGAAG CTCTTTATTG
ATCGATGGCA TCGGCACAAT GGCTGCCGCA GGTCTGGGGT CTTGTTTTCC TACAACCATT
TATATCGGCC ATCCAGCCTG GAAGGAAATG GGGGCTCGCA TTGGCTATTC CTGGCTAAAT
GGCTTGGTAA TGGGTAGCGC TTGTCTTCTA GGTGTTTTTG GTTTGGTCGC AGAGTTGGTA
CCTATTGAGG CAGGCATGGC GATCGTGCTC TATATCGGCA TCGTGATTGC TGCTCAAGCC
TTTCAAGCCA CGCCCAGCAC CCATGCACCG GCAGTAGCTC TTGGCCTGCT TCCTGGATTA
GCAGGATGGG GTGCATCACT GATCAAGGCT GGTTTGCGCG CAGGAGGAGC CGGAAGCAAT
TCAGAACCTT TTAATTCAGC TTTATTAGGA AGGCTTCAAG AAGCTGATGT ATGGGCAAGT
GGCGCTTTTG CCCTTGAGCA AGGACAAATT ATTACCGCCA TGCTGCTAGC GGCAATGTTG
GTTTATGTGA TCGAACAACG CTTTTTAGCA GCAAGCATTA CTTCTTTCCT CGCATCTGCA
GCCTCATGGA TTGGGATCAT CCACGCCTGG CGATTCACCC AGACCGATAC TGTTCTGGAG
CTTGGCTGGG GCGTTGGTAG CTCCTGGGCA ATGGGCTATC TACTAATGGC AGTTGTGTTC
ATGCTCGCCG GTTTGCATCA GCGCAGATTG ATTTCTGGTT GA
 
Protein sequence
MHSDLKQKHW RPQWFVRGDV DGFLGLGLDN LIQVLLIIAL CRNVLGYPNE LVFGTILPAT 
GISLLIGNLA YAHQAHQLAS IEKRSDRTAL PYGINTVSLF AYVFLVMLPV KLTALGQGMD
EASAVRLSWQ AGMVACLGSG LIETVGAFTA GVLRRWLPRA ALLATLAGIA LGYIALGFLL
RTYAQPVVGL TVLAIVLVTY YGRLRLPIPG GLLAVVIGVG LAWSTGLIDS DASRWSQEAS
QIGLRLPHLE IANLWHARGQ LLPWLGVIVP MGLFNVLGSL QNIESAEAAG DRYCVRSSLL
IDGIGTMAAA GLGSCFPTTI YIGHPAWKEM GARIGYSWLN GLVMGSACLL GVFGLVAELV
PIEAGMAIVL YIGIVIAAQA FQATPSTHAP AVALGLLPGL AGWGASLIKA GLRAGGAGSN
SEPFNSALLG RLQEADVWAS GAFALEQGQI ITAMLLAAML VYVIEQRFLA ASITSFLASA
ASWIGIIHAW RFTQTDTVLE LGWGVGSSWA MGYLLMAVVF MLAGLHQRRL ISG