Gene P9303_03471 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9303_03471 
Symbol 
ID4778108 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9303 
KingdomBacteria 
Replicon accessionNC_008820 
Strand
Start bp354184 
End bp355248 
Gene Length1065 bp 
Protein Length354 aa 
Translation table11 
GC content43% 
IMG OID640085850 
Producthypothetical protein 
Protein accessionYP_001016364 
Protein GI124022057 
COG category[S] Function unknown 
COG ID[COG5361] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.604179 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACTTTT CATTCTTTCG CTCCTCTTTG GCGCTGATTG TTTCAGCATT CAGCATGACA 
GCTCTGACTT ACCTTGACTC TGAGAAGAAA GCACTGGCTG GTAATAATTG TCCTAAGGCA
GCAATTGTTG CCGAGCTTGC GGAGGTAGAG AAGGCGACTC CCATCACAAA AGAAAACTAT
GCTTTTGCGG AAACCGATAT CATTCTAGCG GAATATGTGA AGAAGATAGC CAAAAATAAC
TGCTCCGAAG GCATAGGAGA ATTCATGCAT ATTAGAGATG CTATTGATAT TAACGATCGT
ACAATCATTC GCCCGAATTT CGATACGCTG TATTCAGCAG CTGTGATTGA CCTCAATAGA
CCGGCAGTCA TTGTGATGCC TGAAACAGAT AGGCTGCAAA TTTTAGCCGC GCTTGATGAG
GAACATTGGA ATGTTCTTCT CGCAGATCAG CCTGGACGCT ACGAATTTAC AAAAGAGGCA
GTGGGAGCTA GATACATTTT TTTGATTGTA CGCACACAGG TCAATATGAA TGACCCAGAT
GACCTTCAGA AGGTTTCTGC TTTGCAAGAT CGAATTCAAA TTCAACAAAC TGATAAAGGA
GAATATCTTC AGACCAAAAG ATGGGATCGT CGTGAGATTC TTGCATTGCG AGATGAGTAC
AACGAACGCT GGAGCTCTGA GGGCATAAAA AGTGAGTTGG TGTTTGGGGG GAAAGGTGAG
ATCTCCCCTG AGATGAGAAA TTTTGGCGTA GCCTTTGGAT GGGGTGGCCT TCCTAAAAAA
GGAGCTGTCT ACCCTTCGCT GCAAGTGCCA GTTTCAACTG GTCCGCTGAC CTTAACTCTT
AAGGATGTAC CAATCGCTGA TAACTCATTT TGGTCAGTCA CTATATACAA TCAGGAAGGC
TTTTCTCGGG GAGAGCATTA TAATATCAAC AGCGCTTTTG CTAAAGCGAA TAAAAATGGA
GAGTACGTTT TAAATTTCGG GACATCATTA GGGCAAGATA ACTATCTTGA GATTTATCCT
GGCTGGAATG CAACACTTAG AATTTACTCT CCTCAGTCTG CGTAA
 
Protein sequence
MDFSFFRSSL ALIVSAFSMT ALTYLDSEKK ALAGNNCPKA AIVAELAEVE KATPITKENY 
AFAETDIILA EYVKKIAKNN CSEGIGEFMH IRDAIDINDR TIIRPNFDTL YSAAVIDLNR
PAVIVMPETD RLQILAALDE EHWNVLLADQ PGRYEFTKEA VGARYIFLIV RTQVNMNDPD
DLQKVSALQD RIQIQQTDKG EYLQTKRWDR REILALRDEY NERWSSEGIK SELVFGGKGE
ISPEMRNFGV AFGWGGLPKK GAVYPSLQVP VSTGPLTLTL KDVPIADNSF WSVTIYNQEG
FSRGEHYNIN SAFAKANKNG EYVLNFGTSL GQDNYLEIYP GWNATLRIYS PQSA