Gene P9303_07651 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9303_07651 
Symbol 
ID4776956 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9303 
KingdomBacteria 
Replicon accessionNC_008820 
Strand
Start bp704071 
End bp705234 
Gene Length1164 bp 
Protein Length387 aa 
Translation table11 
GC content54% 
IMG OID640086274 
ProductPhage integrase family 
Protein accessionYP_001016781 
Protein GI124022474 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.131913 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAGCTGA GCAACGAGCT AATCAACATC AACCGTGCCC TGGCTGACAG CGGCATCAAC 
CTGAGAATTG AACAGCGAGG CCAGTGGCTC AATTTACGCG GAGCACTGCC CTGCCGGAAT
GGAACTGGAT TGATCAAAAC TCAACGAATC AGTTTGCAGC TTTTGGCAGA ACAAAAAGGA
TTGAAAGAGG CTGAGCGAAT TGTGCAACTG GTGCACTACC AACTGCAACG CAAACAATTC
GACTGGTCCC AGTGGACGAC CAAATCGACA CGGAAACAAC CTGAACAGAT AGCGACTGGG
CTCAGAGAAG CTTTGGTCAG CTTTGAAGAA GCATTCTTTA CTGATCCATA TCGCCGACGG
TCACCAGCCG GTAGCCGCAG CACATGGACG TCCGCTTACC TTCCTTATTT ACGACGACTC
AAAGCCCTAG CTGTTAACAA GCAGAGCTGT TTTGATTCAA ACCTTTTAAG AGACACTCTG
GCCAGTTATG CAGATGGCAG CCGAAGCCGA CAGCAATGCG CCACGGCCCT AGGTGCATTG
GCACGCCACC TGGAAATGGC GCTGCCGGAA GACTGGCGAG CAGAAGCAGA TGGATATGGA
CTACATCAGG CGCGCTTTCG TCAACTACCC AGCGACAAGC AGATCATCGA GGCGGTGGAG
CGCATCCCCA ACCCAGGATG GCGACTTGCC TATGGACTGA TGGCCACTTA CGGCCTGCGC
AATCACGAGG TGTTCTTCTG CGACCTTGCT GCTTTAGCGA AGGGGGAAGA TCAGGTGCTG
CGGGTCCTAC CAAACACAAA AACCGGCGAG CATCAGGTTT GGCCGTTTCA TCCAGACTGG
GTCGAGCATT TTGAACTTGA ACAACTAGCA AACAATGCCC AGGCCCTGCC GCCGGTGAAT
GTCGACCTGC GTCACACCAC ACTGCAACAG GTGGGGAGAA GAGTGTCGGA ACAATTCCGA
CGCTATCAAC TGCCCCTCAC CCCCTACAAC CTGCGGCATG CCTGGGCGGT ACGCACAATC
CACATCGGCC TTCCAGACAC CGTTGCAGCA AGAATGATGG GCCATTCAGT GGCTATTCAT
ACCCGCACCT ATCACCACTG GATCACCCGA CGTGACCAAC AACAAGCGGT AGATGCAGCC
CTAGCTCGAA AGCTCAGCCC ATGA
 
Protein sequence
MELSNELINI NRALADSGIN LRIEQRGQWL NLRGALPCRN GTGLIKTQRI SLQLLAEQKG 
LKEAERIVQL VHYQLQRKQF DWSQWTTKST RKQPEQIATG LREALVSFEE AFFTDPYRRR
SPAGSRSTWT SAYLPYLRRL KALAVNKQSC FDSNLLRDTL ASYADGSRSR QQCATALGAL
ARHLEMALPE DWRAEADGYG LHQARFRQLP SDKQIIEAVE RIPNPGWRLA YGLMATYGLR
NHEVFFCDLA ALAKGEDQVL RVLPNTKTGE HQVWPFHPDW VEHFELEQLA NNAQALPPVN
VDLRHTTLQQ VGRRVSEQFR RYQLPLTPYN LRHAWAVRTI HIGLPDTVAA RMMGHSVAIH
TRTYHHWITR RDQQQAVDAA LARKLSP