Gene P9303_01351 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9303_01351 
Symbol 
ID4776396 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9303 
KingdomBacteria 
Replicon accessionNC_008820 
Strand
Start bp148158 
End bp150467 
Gene Length2310 bp 
Protein Length769 aa 
Translation table11 
GC content49% 
IMG OID640085634 
Producthypothetical protein 
Protein accessionYP_001016155 
Protein GI124021848 
COG category[S] Function unknown 
COG ID[COG3551] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCATTCAA GTGGAGCTTC CCTTTTAGCT GAACAGTTGC AAAATCTAGG AATTTTTCTA 
CCTGGCCAGA TGATTGCAGC TGATAATGAT AATCCTCAGG GTTACTTTGA ATGGGATGAG
GTAGTTGAAC TGCAAGAGAA TTTGTTGATT GCTCTAGATC GATGGTGGCC CTCTCATACT
GGTTCGTTTT GTTTACCATC AAATTGGTTG ATTCACCCCG CTACTCATAA ATTTCGTGCT
CAGCTTACTA ACTTGCTTCG TCATCAAATT TCTACTAGTC AAGCGCTATT CTTAATCAAT
GACCCTCGTA GCAGTATTTT ACTGCCTCTC TGGCGTGATA TTTGCAACCA ACTTGATATC
TCTCTTCGTT TGATTCTGGC ATTTCGCAAA CCGGATGATG TGGTTTCTTC GATCATGTCA
TGTAACGAGC GTCTGGCAGG TATGACATAT TGGAGAGCTC AACAGCTTTG GTGGAGGTTT
AATTCATCCG TACTTGCCTC CGTGCCTTCC AAGTGCGAAG CAGAGCTACT AGTTGTTCAT
TACGACACTT GGTTTGATGA TCCTATAGGT CAGGCACTTT TCTTAGCATC TCATCTTGCT
TTAGAAAAGC CGAATTCAAA TCAGCTTAAT TCTGTCCAAA AAGCAATCTT CTTTCAACAG
CAGAAAGCTA AGTCTCTTCC AGATTATGCA CCGCCTTTGG ATGATCGAAT TAGCAACCTT
TATCATTGGC TGTCTAAGCA AAAGACTGTT CATTTGCCTC TTAAACTGTT TAAAGGCTCT
TTGCAACCAA GGCGCACTTT TCGGCACAAA ATCTTCCATA GAATTGACTG GTTATGGCTT
ATCCGTTCTT CTTTATTGCC GAAGGGCGGA TTGTTTGCCT ATAGAAAAAA CTTTTTGCAG
GGTGTTGGTG CAGGACCTTT AGCTTTACCT GTTTGGATTG CGCGGCAAAG GCCAAGCTTG
TTGCGCTATC ACCGCGATCC TTTGGCTTGG TATCAACGCG TTGGTTGGCG TTTGGGTGTG
AACCCTCATC CATTACTGGA GTCAGCTCGG CTCTGGTCGC ATCTGGGATT CCAGAAAGAG
GCGGTTGCCC TTTATCGGCG AGAGGCCATG TTTGAAAACA TTCCGGTCCA TCCCCGTTTT
GACTCTGTGT ATTACAGGCA GCAATGTCGG AATGCCTATT GCATTCCTCA ACCCACGCCC
TTGGAGCATT ATTTGGTCGA GGGTTGGCAG CAAGGCCTGG CCCCGCATCC AGCTGTCGAT
CCACTCTGGA TGAAGAGGCG GCATGGCTTG CCTGGTGAAC CACTGGTGGC CTTGATCCTC
GATGGGGGAG ATCCCACTGA CCCCGGCCTG ACTCATCCCT GCGGCAATCT TTATGGTGCG
GCCCTAGCCG AGCCACTGTG CTCCACTCGC CTGCCCGTTG CCCTTGTTGA TCTGCTGCGA
CTTTGGAACC AGCGAGGACT ATGGCCAGCA GAACGTTGGC TTGATCATGA GTGCATGCAA
GATCCTTTGC CGAGTTTCAA TCTTTTTGAT ACTGAGCAGG CATCTTTGTT TGCCTTGGGC
TTGCAAGTTC AATTGACTTC TGCTTCACGT CTGCAGATGC CCCCAACCTT AGGTTTTGGC
CATGATTTGC CTTGGCGTGC AGAGCGGTTG TTGGCTGGTT GCAGTGATCA ACTCGCCATT
ACAAAATCCG CCTCGGTAAG GCTGCATGTG TTGGAGGATG CTGATGATTG TCTGCGCTGG
CAACAGACGA GTTCCCCTGG AGATTGGCTG ATCAATTTCC ACTGGCCTCC TACTAAAAGT
CTTGCTAGTT GGATCCAGGG TCTACGCGGC ATGGAAGCAG TGCTGGATCC TGATCCTCAG
CGAACAGCTT TTTTGCAGTT GTTTGGCGTT AAGGCTGTTC ATCAGCCTTT TCAACCTTTG
GAGTTTGCAG CTGGTAGTGA TGAGGATCTC TTGCGTTTGG CACAGTTGAA ACTTGGTCTG
CCAGATCCCC GTTGGTTTGA GCCTTCCCTT GAGCTCGCTG TTATTGGTAG CAGTGGGCCG
ACCCAGGAAC GGCGCTGGGG AGAGTTGGGC CTGAAGTTAG AAGCTGCTGG ATTGCTGCTG
TTGCCGCGTT TGCCTCAGAT TGAAATTGCC AACCTTGATC AGCTCAAGGC GTTGCAGGCT
TGGCTCAATC AATTGGCTCA GAATTGTAAG AGGGTTCTCT GGCTTGAGCC AATACAGCAA
GGTGCTTGTC AGTTGTCATC TGAAGCCGTT GTGTTGGCTC CAGAAGTAGA GCTGGATTTG
CTTCTGCAAT GGGAATCTCG TTGCCGCTGA
 
Protein sequence
MHSSGASLLA EQLQNLGIFL PGQMIAADND NPQGYFEWDE VVELQENLLI ALDRWWPSHT 
GSFCLPSNWL IHPATHKFRA QLTNLLRHQI STSQALFLIN DPRSSILLPL WRDICNQLDI
SLRLILAFRK PDDVVSSIMS CNERLAGMTY WRAQQLWWRF NSSVLASVPS KCEAELLVVH
YDTWFDDPIG QALFLASHLA LEKPNSNQLN SVQKAIFFQQ QKAKSLPDYA PPLDDRISNL
YHWLSKQKTV HLPLKLFKGS LQPRRTFRHK IFHRIDWLWL IRSSLLPKGG LFAYRKNFLQ
GVGAGPLALP VWIARQRPSL LRYHRDPLAW YQRVGWRLGV NPHPLLESAR LWSHLGFQKE
AVALYRREAM FENIPVHPRF DSVYYRQQCR NAYCIPQPTP LEHYLVEGWQ QGLAPHPAVD
PLWMKRRHGL PGEPLVALIL DGGDPTDPGL THPCGNLYGA ALAEPLCSTR LPVALVDLLR
LWNQRGLWPA ERWLDHECMQ DPLPSFNLFD TEQASLFALG LQVQLTSASR LQMPPTLGFG
HDLPWRAERL LAGCSDQLAI TKSASVRLHV LEDADDCLRW QQTSSPGDWL INFHWPPTKS
LASWIQGLRG MEAVLDPDPQ RTAFLQLFGV KAVHQPFQPL EFAAGSDEDL LRLAQLKLGL
PDPRWFEPSL ELAVIGSSGP TQERRWGELG LKLEAAGLLL LPRLPQIEIA NLDQLKALQA
WLNQLAQNCK RVLWLEPIQQ GACQLSSEAV VLAPEVELDL LLQWESRCR