Gene P9303_03251 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9303_03251 
Symbol 
ID4777989 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9303 
KingdomBacteria 
Replicon accessionNC_008820 
Strand
Start bp335857 
End bp336966 
Gene Length1110 bp 
Protein Length369 aa 
Translation table11 
GC content56% 
IMG OID640085828 
Producthypothetical protein 
Protein accessionYP_001016343 
Protein GI124022036 
COG category[S] Function unknown 
COG ID[COG1873] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGACGACAA ACCCATCCCC GAACGATCCA CTGGACAAGG TTCCTAGCGA CCGCCTCTGG 
CTGCGCTCCG AGCTCATGGG GACCCACGTG ATCACCCGTG ATACCGGACG TCGTCTCGGC
GTCGTGGGAG AAGTCGTGGT CGACATTGAT CGCCGCGAAG TGGTGGCCCT AGGCCTGCGA
GACAATCCCC TCACGCGTTA CCTGCCCGGC CTCCCACGTT GGATGCCACT TGATCGGATC
CGTCAGGTGG GGGACGTCAT CCTCGTGGAT TCCTCTGACT CACTCAAAGA GGGTTTCACC
CCAGATCGCT ACAGCCGAGT GATCAACTGC CAAGTGATCA CAGAATCTGG CCAGCAACTA
GGCAGAGTCC TGGGCTTCTC CTTCGACATC GAAACAGGGG AACTCACAAC CCTGGTGATT
GGAGCAATCG GTGTGCCCTT ATTGGGCGAA GGGGTCTTGA GTACCTGGGA GATGCCTGTA
GACGAGGTGG TCAGCAGCGG CGCAGACAGG ATCATTGTGT ATGAAGGAGC AGAAGAGAAG
CTCAAACAAC TGAATAGCGG CTTCCTCGAA AAACTCGGAG TCGGCGGCCC CAGCTGGGAA
GAACAGGAGC GAGAGCGCTA CAGGATGAAT CTTGTGCCAG TGGAAAACCA GCTCAATTCA
GGACAGCCAA CTGAACAGGA GCAGCGCCGG CTCCAACCTT CCACCACTCA AACCTTTGAG
CCGGAAGAGG AACTTGAATA CGTTGAACTG GAAGAGCGTC AACAGGAAGT CATCCCCCAA
CAGCGCTATC TCGACGAAAC ACCCTCAAGC TCCCCAACGC GCTACCGCAA TGACAGAGAA
GAAAGAATGA CCTTCGAAGA ACCTCCTGCC TATGAACAAA GGCCAGTCTT CGAAGAATCA
GCTGCCTATG AACAAAGACG AACCTTTGAA GATCAACAAC CCCAAAGACC AAGGCCAGCT
TCACGTCGAC CTGTTCAGAG CCTTGGTGAT CCTCTTGATG TGGAGCCCCT CGACTTTTCA
GGACGTGATC AAGCTGGCCG AGACCGAGAT GCAGAGGTGG AGGAGCCCCC ACCGCGCCGT
AATGGCACCG AACTGGACGA CCCTTGGTGA
 
Protein sequence
MTTNPSPNDP LDKVPSDRLW LRSELMGTHV ITRDTGRRLG VVGEVVVDID RREVVALGLR 
DNPLTRYLPG LPRWMPLDRI RQVGDVILVD SSDSLKEGFT PDRYSRVINC QVITESGQQL
GRVLGFSFDI ETGELTTLVI GAIGVPLLGE GVLSTWEMPV DEVVSSGADR IIVYEGAEEK
LKQLNSGFLE KLGVGGPSWE EQERERYRMN LVPVENQLNS GQPTEQEQRR LQPSTTQTFE
PEEELEYVEL EERQQEVIPQ QRYLDETPSS SPTRYRNDRE ERMTFEEPPA YEQRPVFEES
AAYEQRRTFE DQQPQRPRPA SRRPVQSLGD PLDVEPLDFS GRDQAGRDRD AEVEEPPPRR
NGTELDDPW