Gene P9303_11021 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9303_11021 
Symbol 
ID4777670 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9303 
KingdomBacteria 
Replicon accessionNC_008820 
Strand
Start bp984513 
End bp985982 
Gene Length1470 bp 
Protein Length489 aa 
Translation table11 
GC content53% 
IMG OID640086611 
Producthypothetical protein 
Protein accessionYP_001017116 
Protein GI124022809 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.879161 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCAAGC AAAGCTTGCC TGCGATCGCT GGCTACGAAG CCGAAATTAT TGCCCTGGTT 
CAGCAAGCTA ATCCCGCTGT GCTCACGCAT AGCAATCGCA AGCTGGAGCA GTTTCAGTCA
GCCTTTGCCT GTGCTCTCCA CATGCATCAG CCCACGATTC CAGCAGGTGC GAATGGAGAA
CTGATCTCGC ATCTGCAATA CATGCTGGAG CACAGCGAAG AAGGCGATAA CCACAACGCC
GAACCCTTTG CCCATTGCTA CAAGCGCCTC GCAGACATCA TTCCCCAACT GATTCAAGAG
GGATGCAACC CTCGCATCAT GCTGGATTAC TCAGGCAACT TGCTCTGGGG TGTTGAGCAA
ATGGACCGTG TCGACATTCT CGAGGCGCTT AAACGTCTGG CATGCGATCC CACGCTTCAA
CCCCATGTGG AATGGCTCGG CACCTTTTGG AGCCATGCCG TTGCACCCTC AACTCCTATT
CCAGATCTAA AGCTACAAAT CCTGGCCTGG CAGCATCAAT TCGCTGCCAT GTTTGGGCGG
CAGGCATTGC AACGAGTGAA GGGTTTCTCG CCACCGGAAA TGCACCTTCC CAACCATCCA
GACACCCTCT ATGAATTGGT GAAAGCCCTC AGAGATTGCG GATACCGTTG GCTCCTTGTT
CAAGAAAACA GCGTTGAAAA CTTCGATGGC TCATGCCTTC GCCATGCACA GAAATACGGC
CCCAATCAAC TTGTGGCTCG TAACTCCAGA GGGGAAACAG TCAGCATTGT GGCGTTGATC
AAAACCCAGG GCTCAGACAC CAAATTGGTA GGGCAGATGC AGCCCTATCA CGAAGCATTA
GGCCTGGGCA GACAATCACT GGCAGGCAAA TCGATTCCAT CATTGGTCTC TCAAATTGCC
GACGGAGAAA ATGGAGGCGT AATGATGAAT GAGTTTCCAG CCGCCTTTAT CCAAGCCCAT
CAAACCATTG CTTCCCAGGT TGATCCTGTA AGCACAGTTG CGCTCAACGG CACTGAATAT
CTGGAGTTAC TGGAGGCCGC CGGTGTGGAA GCCTCTGATT ACCCAAAGAT TCAGGCGATA
CAGCAACACA AGCTGTGGCA TAACACTGAC AGCCCCATCA ACCCCGAATC AATCGAAGCG
GCCATCAGCG ACCTCAAAGA AACAGATCCC TCCTTCTCAA TGGACGGCGC ATCCTGGACC
AACAATCTCA GCTGGATCAA GGGCTATGAA AATGTGCTGG AACCGATCAA CAGCCTCAGC
GCAAAGTTTC ACCAGCTGTT TGATCCCTTG GTGACAAAGG ATCCAGCGAT CACGCAAACC
CCGCACTATC AAGAAGCTCT GCTCTACCTG CTAATGCTAG AAACCAGTTG CTTCCGCTAT
TGGGGGCAAG GCACCTGGAC CAACTACGCC AATGAGATCC ACCGGCGCGG TGAAGCCATG
GTCGAAGCAG CAAACCAGGC ACTGAGATAG
 
Protein sequence
MPKQSLPAIA GYEAEIIALV QQANPAVLTH SNRKLEQFQS AFACALHMHQ PTIPAGANGE 
LISHLQYMLE HSEEGDNHNA EPFAHCYKRL ADIIPQLIQE GCNPRIMLDY SGNLLWGVEQ
MDRVDILEAL KRLACDPTLQ PHVEWLGTFW SHAVAPSTPI PDLKLQILAW QHQFAAMFGR
QALQRVKGFS PPEMHLPNHP DTLYELVKAL RDCGYRWLLV QENSVENFDG SCLRHAQKYG
PNQLVARNSR GETVSIVALI KTQGSDTKLV GQMQPYHEAL GLGRQSLAGK SIPSLVSQIA
DGENGGVMMN EFPAAFIQAH QTIASQVDPV STVALNGTEY LELLEAAGVE ASDYPKIQAI
QQHKLWHNTD SPINPESIEA AISDLKETDP SFSMDGASWT NNLSWIKGYE NVLEPINSLS
AKFHQLFDPL VTKDPAITQT PHYQEALLYL LMLETSCFRY WGQGTWTNYA NEIHRRGEAM
VEAANQALR