Gene P9303_20571 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9303_20571 
Symbol 
ID4776586 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9303 
KingdomBacteria 
Replicon accessionNC_008820 
Strand
Start bp1811084 
End bp1813426 
Gene Length2343 bp 
Protein Length780 aa 
Translation table11 
GC content36% 
IMG OID640087566 
ProductSAM-binding motif-containing protein 
Protein accessionYP_001018058 
Protein GI124023751 
COG category[M] Cell wall/membrane/envelope biogenesis
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG2230] Cyclopropane fatty acid synthase and related methyltransferases
[COG5010] Flp pilus assembly protein TadD, contains TPR repeats 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.183834 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAGGAT TTGGAGAAAA GAAGAAAATA AAAAATAAAC AACCCTCCAA GAAGTTTGCC 
CGAATACCAC CTGATCAACT CAAGGCTATC GCTTTTAAAT ATCACCAACA AGGCAATATA
AATGAGGCGC AAAAAGCCTA TCAGGAATTT ATTAATAGTG GTTTAAGTGA TCCAGATGTT
TTCTCTAACT TTGCATTAAT CTGTCAATCC CAAGGAGAGA TTGACAAGGC AATAAAAGTC
TATAAGAAAA GCGTCAAACT ATTTCCTGGC CATGCTTTTT CACATGCAAA CTTGGGATAT
CTTCTCTTTC AGATAGGAAT GTTAGATGAT GCTGAAGTGG CTATCCGCCA GGCGATTGTT
ATACAGCCTA ACCTCGCCAA TGCTTATTCA TACTTGGGTT TAGTTTTACG AGAAAAAGGA
AGACTAACTG ATGCAGAAGA TATAACCCGA AAGGCCATTG AACTTCAACC AGATTTGGCT
GATGCATACG TAAACCTTGG TCAGATCCTA CAGAATCAAG GCAAACTAGA TGAAGCAGAA
CATACAACAC GCAAGGCAAT TGAATTACAA GATGATTCGG CAAGTATATA TCTCAATCTA
GGTGGTATCC TGCAAGACCA AGGTAATCTA ACTGATGCAG AAGCGAACAC AAGAAAGGCA
ATGAACTTAC AGGCCGATCT GCCTGATGTC AATTTAAACC TTTCTATAAT TTTAAAGGAT
CTTGGCAGAT TAGAGGAGGC TGTATTTCAT TTAACAAGAG AAATTGAGCT TTACCCACAA
AACCAATCAT CATATTTGCT TCTTAACTCA CTGCTTGAGG AATCTGATCT TTCATTCTTA
CCTGAAAGAC AGTCTAGAAT TTTGCTTCGT GGCCTATTAA AGAGAAATGA TATTGCACAT
AAAAATCTAT TTTCAGCAAT TAATCGCTTG ATCTCAGAAC AAACACTAGA CAAAATTTCA
GGCATTCATC ATGATTTATT TGATGACCCC TCTTTTCAAC AAATTCTTGC TGATGATGAG
ATAATCAGTG CCCTTGGTTT GATGTTATTT ACCACAATGG CCTGGGAGAA GGCATTGACT
AATATACGCA AGCAAATATG CCTTTCAATT CAAAATAATG GTTTTGATAA AAGGATTATT
GATTTAACTA TTGCTCTTGC GGAGCAGTGT TTCCTAAATG AATATGTTTT TACATTTACC
AAGCAAGAAT CTGATGCAAT AGAACAGTTT AAACTTTCCT ACTTGAGAAG TGATTTTGAT
TTAAAAACTC TTTCTATCTT AGCTTGTTAT ATACCCATTA CACATCTTTC TGAACAGTTC
CCTTTATTAA GAGGATTCAT CGATGAAAAT GAGAAATTAA ATAATTTAAA AATTATGCAA
CTCGTCGAAC CCGAACGTGA ACATGATTTT GCAGTATCTA TTCCTAAATA TGGATCTATA
GATGATGGTA CTTCTATTCA GGTTAAAAAG CAGTATGAAG AGCATCCTTA TCCGCGTTGG
AGATATGCTT CTTATTCAAG TGAAAATATT CAAACTATAT CTTCAGCGAT TAATAATGAA
ATAAACCCTA ATCGGATCTC AATCATTTTG CCCAAGCAGA GATCTCGCGT CTTAATAGCT
GGCTGTGGAA CCGGACAGCA AATATTCGAT GCCCTCTCTT ATTCTAACTC TGATTTAACT
GCTATCGATC TCAGTTCTTC TAGTATCGCT TATGCAAAGC GTAAGGCTCA TGAATATGGA
ATTGAGCATA TAAGATTTAT AGAGATGGAT ATTCTTGATC TTCCAAAACT AAATGAAGAA
TTTGATCTCA TAGAATGTAC GGGAGTTCTC CATCATATGA AGGACCCATC TGAAGGTCTG
CAATCTCTAC TTACAATACT CGCATCCGAT GGAATGCTCA AATTAGGTTT TTATAGTGAG
CTAGCACGTC AAGATATTGT CGAGGCTAGG AAAATTATTA AATCAGAATC TTTTGAGGCT
AGTAATGAAG GCATTTGTCT ATTTAGGGAT AAACTAATCA ACGGTGAATA TCCGAACATT
AGTTCAATCT CAAACTGGCC AGACTTCTAC ACAACTTCTA TGTGTAGAGA CCTTTGTTTT
CATATTATGG AGCATCGTTA CTCACTAGTG ATGATAGCAT CGCTGCTTGA ACAATTTGAA
CTTAGATTCT TAGGTTTTGT GTTGCCAAGT ATTGTTAAAA AAGATTATGC CAGGGCTTAT
CCATCTGATG CAATGCAGAC TGACCTCGGC TATTGGCACG AGTATGAACA AGCCAATCCC
AATACCTTTC GACAAATGTA CCAGTTCTGG ACGAATAGAA AGAAGATAAT GTATTACGGT
TAA
 
Protein sequence
MKGFGEKKKI KNKQPSKKFA RIPPDQLKAI AFKYHQQGNI NEAQKAYQEF INSGLSDPDV 
FSNFALICQS QGEIDKAIKV YKKSVKLFPG HAFSHANLGY LLFQIGMLDD AEVAIRQAIV
IQPNLANAYS YLGLVLREKG RLTDAEDITR KAIELQPDLA DAYVNLGQIL QNQGKLDEAE
HTTRKAIELQ DDSASIYLNL GGILQDQGNL TDAEANTRKA MNLQADLPDV NLNLSIILKD
LGRLEEAVFH LTREIELYPQ NQSSYLLLNS LLEESDLSFL PERQSRILLR GLLKRNDIAH
KNLFSAINRL ISEQTLDKIS GIHHDLFDDP SFQQILADDE IISALGLMLF TTMAWEKALT
NIRKQICLSI QNNGFDKRII DLTIALAEQC FLNEYVFTFT KQESDAIEQF KLSYLRSDFD
LKTLSILACY IPITHLSEQF PLLRGFIDEN EKLNNLKIMQ LVEPEREHDF AVSIPKYGSI
DDGTSIQVKK QYEEHPYPRW RYASYSSENI QTISSAINNE INPNRISIIL PKQRSRVLIA
GCGTGQQIFD ALSYSNSDLT AIDLSSSSIA YAKRKAHEYG IEHIRFIEMD ILDLPKLNEE
FDLIECTGVL HHMKDPSEGL QSLLTILASD GMLKLGFYSE LARQDIVEAR KIIKSESFEA
SNEGICLFRD KLINGEYPNI SSISNWPDFY TTSMCRDLCF HIMEHRYSLV MIASLLEQFE
LRFLGFVLPS IVKKDYARAY PSDAMQTDLG YWHEYEQANP NTFRQMYQFW TNRKKIMYYG