Gene P9303_04201 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9303_04201 
Symbol 
ID4778990 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9303 
KingdomBacteria 
Replicon accessionNC_008820 
Strand
Start bp420891 
End bp422414 
Gene Length1524 bp 
Protein Length507 aa 
Translation table11 
GC content46% 
IMG OID640085924 
Producthypothetical protein 
Protein accessionYP_001016437 
Protein GI124022130 
COG category[S] Function unknown 
COG ID[COG5361] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCACAT TAGCGAATGG ATTGTCAAGC CCAATGGGAT TTGCAGAGGC AACTGCATCA 
AAAAAAGTAT CCCAGAATTT CACGACTCCA ATACCCTCAG GAATTCTCAC GCCAGACACC
GTAAAGACGA GTGCTGGAAC ATTCAGATTT TTCGACGGCA TGCCTGACAA AGAGACGGTT
CGGAAAAGTT TTGAAAATCT CAAGTTCATT CGTGGCTATG AGACATTTCT GACCTTGATG
CCAGCAGCAA GCATTGAAAT GCTTCGCCAT GGTCATGCAG AAGTAGGAGT CGATAACCAC
ACGAAGGTGG CATTGATGGC CCCCTTGAAC TCGAACCCTC TGTTCCTGAC AGGAAATACC
GACACCGTCT ACGGATCGAC GTTCTTCAAC CTGAAGCAGA CAGGGCCTCT GGTGATCGAG
ATCCCTGCCG GGCTTGGTCC AGGAACCATT AATGATGCCT TTTTCCGCTT TGTTGCTGAT
ACTGGAGCTC CTGGGCCCGA TAAGGGAAAA GGAGGAAAGT ATCTCATCCT GGGGCCAGAT
GATGCTGAGC CAAATAATGC TGACGGCTAT TTTGTTTTTC GTTCTCCTAC GTATTCAAAT
TGGTTAATTC TTAGAGCTTT TCTTGATTCC AAGGGTAAGC CTGATCAAGC AATCGCTAAT
TATGAAAATG GTTTGCGTCT TTATCCCTAT TCACAAAGGG ACAACCCGGC TCAGATGAGC
TTTATTCAAG TTGGTGAAAA GGTTTTCAAT ACAGTTCATG CCAATAATTT TGAGTTCTTT
AATGAGCTCA ATACCGTCAT TCAGCGTGAG CCCGTTGCTT TCCTTGATCC TGAGCTAAGG
GGATTGGCAT CTGCTATCGG ATTGGAAAAA GGCAAACCTT TTTCTCCGTC TCCAGAAGAC
AAAGAGATCC TAGAGGAAGC AATTCAAGTT GGTGTTGCCT ACGTGCGCTC CGACATGGGT
AAGCCCCGCA ATCAGGATGT TTACTTCTAT CAAGGCAAAA AGTGGTTTAC ACCTTTTGGG
GGTGGAAGTC ATGAATGGCT TGTTGATAAT GGTGCAGGTG GTCGAAATTT GGATGCTCGC
AATAATTTCT TTTGGGGCTA CACAGTAAAT ACACCAGCGA TGGTGTTGCA GATGATTGGG
GCTGGGTCGC AATATGGCGT TGTGGCCACT GATGCTAATA GTCGTTATCT CGATGGAAGC
AAAACTTACA AGTTCACGAT TGATAAGGAT GTGCCTGCGA AAGACTTTTG GTCGATGGTC
GCTTACGACC CGCAGACACG GTCTGAGCTT CAGACTGGCC AGCTCTTGCC AAGCAAAAAC
AGTATCCGAA ATCAGGATTT AGACGTAAAC GCTGATGGCA GTATTGACCT CTATTTCGGC
CCCAAGTCCC CCACGGGAAA AGAAGCCAAT TGGATTGAAA CTGTTCCTGG TAAGGGCTGG
TTTGCTGTGT TTCGTCTCTA CGGACCGCTG CAACCTTGGT TCGACAAAAT TTGGCATCTC
AATGACATTC AACCGATGGA ATAA
 
Protein sequence
MSTLANGLSS PMGFAEATAS KKVSQNFTTP IPSGILTPDT VKTSAGTFRF FDGMPDKETV 
RKSFENLKFI RGYETFLTLM PAASIEMLRH GHAEVGVDNH TKVALMAPLN SNPLFLTGNT
DTVYGSTFFN LKQTGPLVIE IPAGLGPGTI NDAFFRFVAD TGAPGPDKGK GGKYLILGPD
DAEPNNADGY FVFRSPTYSN WLILRAFLDS KGKPDQAIAN YENGLRLYPY SQRDNPAQMS
FIQVGEKVFN TVHANNFEFF NELNTVIQRE PVAFLDPELR GLASAIGLEK GKPFSPSPED
KEILEEAIQV GVAYVRSDMG KPRNQDVYFY QGKKWFTPFG GGSHEWLVDN GAGGRNLDAR
NNFFWGYTVN TPAMVLQMIG AGSQYGVVAT DANSRYLDGS KTYKFTIDKD VPAKDFWSMV
AYDPQTRSEL QTGQLLPSKN SIRNQDLDVN ADGSIDLYFG PKSPTGKEAN WIETVPGKGW
FAVFRLYGPL QPWFDKIWHL NDIQPME