Gene P9303_22061 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9303_22061 
Symbolsmf 
ID4778027 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9303 
KingdomBacteria 
Replicon accessionNC_008820 
Strand
Start bp1960384 
End bp1961520 
Gene Length1137 bp 
Protein Length378 aa 
Translation table11 
GC content57% 
IMG OID640087722 
ProductSMF family protein 
Protein accessionYP_001018206 
Protein GI124023899 
COG category[L] Replication, recombination and repair
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG0758] Predicted Rossmann fold nucleotide-binding protein involved in DNA uptake 
TIGRFAM ID[TIGR00732] DNA protecting protein DprA 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGTACTA TTTTGCTTGT GGAGCGTCGG CATTGGTGGT GGCTTTGGAG CCGCTGTCCA 
GGGATTGGAG CTGCTCGAAT GGGAGACCTC GAGGCGCTAG GCAAGGCTCA CGAGGTCAGT
CTGGCTGAGC TATGGACTTG GCCTGAGGAA AGGTTGCGCA AGGTGCTCTT CTGGCCGACA
GCCTTGTTCA AAGACTTGGG CCTCCACCGC AGCAAGTGGG GAACTTGTCC AAGTGTCGAC
GTGCCGGAGG ATGTCTTGAT GCCTGTGGAT TTGCTTTGGC CGGAAGGGCT TCGTGCCTTG
AAGCGCCCTC CTTTGGCGCT TTTTTGGCAG GGCCGGCAGG AGCTTTTGGG ATGCCTTGGG
GCCCGTAGGG CGGTAGCAAT TGTTGGCACA CGACGGCCTT CGAACCATGG TTTGCGTGTG
GCTGAAGCAT TGGGTCGTGC TTTGGCACTA GCGGGTTGGC CTGTGATTAG TGGCCTTGCA
GAAGGGATTG ATGCGGCTGC CCATCGCGGC TGTTTGGAAG GCGGTGGTGC GCCTGTGGGC
GTGCTGGGCA CACCTTTGCA GAAGGTCTAT CCCAGGCAGA ATGAGGGCCT TCAAGCTCTG
GTCGCGGCTC AAGGGCTGCT AGTCACAGAG CAGCCCAGGG AGACTTTGGT CAAGCGCGGT
TGTTTTGCAG CCCGTAATCG CTTATTGGTG GCCTTGGCAA AGGCTGTGGT CGTCGTCGAG
TGCCCCGAGA GAAGTGGAGC CTTGATTACA GCGCGGCGGG CAATAGAGCA GCAATGTCAG
CTGTTGGTCG TGCCCGGTGA TGCAAGGCGA TGGTCGGCCC TCGGGAGCAA TGCTTTGTTG
TTGGATCAGG CTTCCCCTTT GCTAAGCCCT GAAGCTCTTG TAAAACAACT TGGTACTGGT
CCGCTGGCGG TTCATTCTCC TTCGGTTGCT TTTGATTTAT CTGGTTCTCG CTCTAGCTCA
CGAGCCGGCC AGCATGGCGA TACAGCACTG TTACAGGCCA TTGGCGATGG TGCATCCCTG
GAGGATTTGA TGACCGGTTT GAATCTGTCT TCGGCGCGCT TGACAGAACA ATTGCTTCAG
TTGGAGTTGA AGGGTGTTGT GGTGGCAGAG CCTGGTTTGC ATTGGCGTTT GGCCTAG
 
Protein sequence
MRTILLVERR HWWWLWSRCP GIGAARMGDL EALGKAHEVS LAELWTWPEE RLRKVLFWPT 
ALFKDLGLHR SKWGTCPSVD VPEDVLMPVD LLWPEGLRAL KRPPLALFWQ GRQELLGCLG
ARRAVAIVGT RRPSNHGLRV AEALGRALAL AGWPVISGLA EGIDAAAHRG CLEGGGAPVG
VLGTPLQKVY PRQNEGLQAL VAAQGLLVTE QPRETLVKRG CFAARNRLLV ALAKAVVVVE
CPERSGALIT ARRAIEQQCQ LLVVPGDARR WSALGSNALL LDQASPLLSP EALVKQLGTG
PLAVHSPSVA FDLSGSRSSS RAGQHGDTAL LQAIGDGASL EDLMTGLNLS SARLTEQLLQ
LELKGVVVAE PGLHWRLA