Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9303_22061 |
Symbol | smf |
ID | 4778027 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9303 |
Kingdom | Bacteria |
Replicon accession | NC_008820 |
Strand | - |
Start bp | 1960384 |
End bp | 1961520 |
Gene Length | 1137 bp |
Protein Length | 378 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 640087722 |
Product | SMF family protein |
Protein accession | YP_001018206 |
Protein GI | 124023899 |
COG category | [L] Replication, recombination and repair [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG0758] Predicted Rossmann fold nucleotide-binding protein involved in DNA uptake |
TIGRFAM ID | [TIGR00732] DNA protecting protein DprA |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGTACTA TTTTGCTTGT GGAGCGTCGG CATTGGTGGT GGCTTTGGAG CCGCTGTCCA GGGATTGGAG CTGCTCGAAT GGGAGACCTC GAGGCGCTAG GCAAGGCTCA CGAGGTCAGT CTGGCTGAGC TATGGACTTG GCCTGAGGAA AGGTTGCGCA AGGTGCTCTT CTGGCCGACA GCCTTGTTCA AAGACTTGGG CCTCCACCGC AGCAAGTGGG GAACTTGTCC AAGTGTCGAC GTGCCGGAGG ATGTCTTGAT GCCTGTGGAT TTGCTTTGGC CGGAAGGGCT TCGTGCCTTG AAGCGCCCTC CTTTGGCGCT TTTTTGGCAG GGCCGGCAGG AGCTTTTGGG ATGCCTTGGG GCCCGTAGGG CGGTAGCAAT TGTTGGCACA CGACGGCCTT CGAACCATGG TTTGCGTGTG GCTGAAGCAT TGGGTCGTGC TTTGGCACTA GCGGGTTGGC CTGTGATTAG TGGCCTTGCA GAAGGGATTG ATGCGGCTGC CCATCGCGGC TGTTTGGAAG GCGGTGGTGC GCCTGTGGGC GTGCTGGGCA CACCTTTGCA GAAGGTCTAT CCCAGGCAGA ATGAGGGCCT TCAAGCTCTG GTCGCGGCTC AAGGGCTGCT AGTCACAGAG CAGCCCAGGG AGACTTTGGT CAAGCGCGGT TGTTTTGCAG CCCGTAATCG CTTATTGGTG GCCTTGGCAA AGGCTGTGGT CGTCGTCGAG TGCCCCGAGA GAAGTGGAGC CTTGATTACA GCGCGGCGGG CAATAGAGCA GCAATGTCAG CTGTTGGTCG TGCCCGGTGA TGCAAGGCGA TGGTCGGCCC TCGGGAGCAA TGCTTTGTTG TTGGATCAGG CTTCCCCTTT GCTAAGCCCT GAAGCTCTTG TAAAACAACT TGGTACTGGT CCGCTGGCGG TTCATTCTCC TTCGGTTGCT TTTGATTTAT CTGGTTCTCG CTCTAGCTCA CGAGCCGGCC AGCATGGCGA TACAGCACTG TTACAGGCCA TTGGCGATGG TGCATCCCTG GAGGATTTGA TGACCGGTTT GAATCTGTCT TCGGCGCGCT TGACAGAACA ATTGCTTCAG TTGGAGTTGA AGGGTGTTGT GGTGGCAGAG CCTGGTTTGC ATTGGCGTTT GGCCTAG
|
Protein sequence | MRTILLVERR HWWWLWSRCP GIGAARMGDL EALGKAHEVS LAELWTWPEE RLRKVLFWPT ALFKDLGLHR SKWGTCPSVD VPEDVLMPVD LLWPEGLRAL KRPPLALFWQ GRQELLGCLG ARRAVAIVGT RRPSNHGLRV AEALGRALAL AGWPVISGLA EGIDAAAHRG CLEGGGAPVG VLGTPLQKVY PRQNEGLQAL VAAQGLLVTE QPRETLVKRG CFAARNRLLV ALAKAVVVVE CPERSGALIT ARRAIEQQCQ LLVVPGDARR WSALGSNALL LDQASPLLSP EALVKQLGTG PLAVHSPSVA FDLSGSRSSS RAGQHGDTAL LQAIGDGASL EDLMTGLNLS SARLTEQLLQ LELKGVVVAE PGLHWRLA
|
| |