Gene P9301_01201 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9301_01201 
Symbol 
ID4911057 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9301 
KingdomBacteria 
Replicon accessionNC_009091 
Strand
Start bp117894 
End bp119048 
Gene Length1155 bp 
Protein Length384 aa 
Translation table11 
GC content29% 
IMG OID640159685 
Producthypothetical protein 
Protein accessionYP_001090344 
Protein GI126695458 
COG category[S] Function unknown 
COG ID[COG3146] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACCAAC CAATACATAA AGTTGAAGTC AAATTATCAA TTAAAGAAAT TTCCAAGGAG 
ATATGGAATG AATTAGCCAA TGGAATTAAT AATCCATTTT ATGAATGGAC TTGGATTAAA
AACCTTGAGA TATCAAAAAG TGTTTCAAGA GAAACTGGTT GGCAGCCATT ATATTTTGTT
GCTGTTAAAA ATGAAGAGAT ATTAGGTATT GCACCACTTT TTTTAAAAAA TCATAGCTAT
GGAGAATTCA TTTTTGACCA ATCATTTGCA AGATTGGCTC AAGAGCTGAA TTTAAATTAT
TACCCTAAAT TAATTGGAAT GAGTCCTTAT AGTCCTGTAA ATGGATATCA ATTTCTTTAT
AAAAAAAATA AAGACAAGAA CGAAATTACA AATTTACTTA TAAACCACAT CGAAAGCTTT
GCGATTAAAA ACAAAGTTCT AAGTTGTAAT TTTTTATATA TTGATGAAAG CTGGGGCAAC
CATCTTAAAT CTTTGGGATA CCATGAATGG ATAAATTCCA GCAGTGAATG GAGGAGTAAT
GGAGAAAAAA CATTTAATGA TTTTCTTTCT AGATTTAACT CTAATCAGAG AAAAAATATA
AAAAAAGAGA GGAAATCAAT TACTAAACAA GATATTAAAG TAGAAATTTT TAATGAAGAT
GATATCAACC AAGAAATACT CAAAAAAATG CATAATTTTT ATGAACAGCA TTGCTCGAGG
TGGGGAGTTT GGGGAAGTAA ATATCTAACA TCTACATTTT TCGAAACACT GGTTGATAAT
AAAAAAAATC TTTTACTTTT TAGCGCATCA AAACATGATT CAGATGAAAT TTTTGCTATG
TCGATGTGCG TTAAAAATCA AAACAACTTA TGGGGTAGAT ATTGGGGTAG TCAAAAAGAA
ATATCTAATT TACATTTTGA ATTATGCTAT TACCAGCCAA TTGAATGGGC AATAAAAAAT
GGTATCCATT TGTTTGATCC TGGAGCGGGT GGCAAACATA AGAGACGTAG AGGATTTTTT
GCAAAAAGCA CTATTAGCTT GCATAAGTGG TTTGACAAAA ATATGGAAAA TATAATTAGT
CCTTGGCTAA ATGAAGTGAA TAAACAAACC GAGATGGAAA TTGATTTTGA AAATAAATCT
ATACCCTTTA AATAA
 
Protein sequence
MNQPIHKVEV KLSIKEISKE IWNELANGIN NPFYEWTWIK NLEISKSVSR ETGWQPLYFV 
AVKNEEILGI APLFLKNHSY GEFIFDQSFA RLAQELNLNY YPKLIGMSPY SPVNGYQFLY
KKNKDKNEIT NLLINHIESF AIKNKVLSCN FLYIDESWGN HLKSLGYHEW INSSSEWRSN
GEKTFNDFLS RFNSNQRKNI KKERKSITKQ DIKVEIFNED DINQEILKKM HNFYEQHCSR
WGVWGSKYLT STFFETLVDN KKNLLLFSAS KHDSDEIFAM SMCVKNQNNL WGRYWGSQKE
ISNLHFELCY YQPIEWAIKN GIHLFDPGAG GKHKRRRGFF AKSTISLHKW FDKNMENIIS
PWLNEVNKQT EMEIDFENKS IPFK