Gene P9303_02101 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9303_02101 
Symbol 
ID4777472 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9303 
KingdomBacteria 
Replicon accessionNC_008820 
Strand
Start bp227187 
End bp228323 
Gene Length1137 bp 
Protein Length378 aa 
Translation table11 
GC content53% 
IMG OID640085709 
Producthypothetical protein 
Protein accessionYP_001016230 
Protein GI124021923 
COG category[S] Function unknown 
COG ID[COG3146] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGCATCACT GGGAAAACCT GGTTGGTGAG CAGGCGATTC CTTTCTTTCG CTGGCGGTGG 
CTTGCTGCTC TTGAAGACTC AAGCAGCATT TCCGCCAAAC ATGGTTGGCA GCCACTGCAC
TTAGCCCTGT GGCGAGACGA CACGCCTGTA GCGGTTGCCC CTCTTTATCT CAAGGGGCAT
AGCTATGGCG AATTTGTTTT TGATCAGGCC TTTGCTCGCC TGGCAGGTGA TCTTGGTTTG
GGGTATTACC CAAAACTGCT GGGGATGAGC CCTGTGAGCC CTGTGCAGGG CTATCGCTTC
TATGTGGCGC CAGGGGAGGA CGAGGCAGAG ATGACAGTTT TGATGCTGGA AACCATCGAT
GCTTTTGCGC GTCGCAACCA GATTCTCAGC TGTAACTTTC TTTATGTTGA TCCGCATTGG
CGGCCTTTGG CGGAAGCTGC GGGCTGTGCC ACTTGGTTGA ACCAGCAGAG CCTTTGGTCA
GCAGATGGGC AGTCTGATTT CTCTGCCTAT CTCAATAGCT TCAATGCCAA TCAGCGACGC
AATATCAAGC GTGAACGCAA GGCCGTCCAG CAGGCGGGGC TCACGGTTTC AGCGTTGACA
GGAGCAGAAC TTGATGTGCA GCTGTTGAGG TGCATGTATG GCTTTTATGA GCAGCATTGC
GCTCGTTGGG GACCTTGGGG AAGCAAGTAT CTCTCTGAAG CGTTTTTTGA GGCCTTGGCA
GATTCGTCTC TCAGAGATCA GGTGGTGTTG TTTAGTGCCC ATCGTGAGAG TCCTAGAGAG
CCTGTAGCGA TGTCTCTTTG TATACAGGAT GGACAAATGT TGTGGGGGCG TTATTGGGGT
AGCAAGGAGG AGATCGATTG CCTTCATTTC GAGGTTTGTT ATTACGCGCC GATTGCCTGG
GCGTTGGAAC ATGGTTTAGA GCATTTTGAT CCTGGCGCAG GCGGTCAACA CAAGCGCCGT
AGGGGCTTTG TGGCGAAGCC CCATGCCAGC TTGCATCGTT GGTATGAACC GCGTATGGAT
GCTTTGATCC GTGGATGGTT GAGGAAGGTC AATCCTCTAA TGCTCGAGGA GATTGAGTCG
GTGAATGCTG ATTTGCCGTT TCGGGTTGAG CCTGCCCCTC AGTTAATTGT GGAATAA
 
Protein sequence
MHHWENLVGE QAIPFFRWRW LAALEDSSSI SAKHGWQPLH LALWRDDTPV AVAPLYLKGH 
SYGEFVFDQA FARLAGDLGL GYYPKLLGMS PVSPVQGYRF YVAPGEDEAE MTVLMLETID
AFARRNQILS CNFLYVDPHW RPLAEAAGCA TWLNQQSLWS ADGQSDFSAY LNSFNANQRR
NIKRERKAVQ QAGLTVSALT GAELDVQLLR CMYGFYEQHC ARWGPWGSKY LSEAFFEALA
DSSLRDQVVL FSAHRESPRE PVAMSLCIQD GQMLWGRYWG SKEEIDCLHF EVCYYAPIAW
ALEHGLEHFD PGAGGQHKRR RGFVAKPHAS LHRWYEPRMD ALIRGWLRKV NPLMLEEIES
VNADLPFRVE PAPQLIVE