Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sare_4133 |
Symbol | |
ID | 5705578 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salinispora arenicola CNS-205 |
Kingdom | Bacteria |
Replicon accession | NC_009953 |
Strand | - |
Start bp | 4696820 |
End bp | 4697671 |
Gene Length | 852 bp |
Protein Length | 283 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 641273561 |
Product | hypothetical protein |
Protein accession | YP_001538914 |
Protein GI | 159039661 |
COG category | [R] General function prediction only |
COG ID | [COG1611] Predicted Rossmann fold nucleotide-binding protein |
TIGRFAM ID | [TIGR00725] conserved hypothetical protein, DprA/Smf-related, family 1 [TIGR00730] conserved hypothetical protein, DprA/Smf-related, family 2 |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.242281 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.352812 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCGAGA GCAACGGGCA GGAGGCGGAC GGTGTCCTCC GGCGGGATCG ACACCGGGGC GCCGGGCTAC TGCGTCGGTC GACGGTCGGC GGGAGTACCG CCGACCAGCG GCTGCTCGAC TCCCCCCAGC GCAGCGACTG GAAGACCCGG GACGCCTGGC GGGCGCTGCG GATCCTCTCC GAGTTCGTCG AGGGCTTCGA CACCCTGTCC GACCTGCCGT CAGCGGTCAG CGTCTTCGGT TCGGCGCGGA GCCGACCGGA CAGCCCGGAG TGCCGGATGG CCGAGGCGCT GGGCGGTGCA CTGGCCCGTG CCGGATACGC GGTCATCACC GGCGGGGGCC CGGGGGTGAT GGCGGCGGCG AACCGGGGAA CCAGGGAAGC CGGCGGGCTC TCCGTCGGCC TGGGCATCGA GCTCCCCTTC GAGCAGGGCA TCAACGACTG GGTCGATCTG GCGATCGAGT TCCGGTACTT CTTCGCGCGA AAGACCATGT TCGTCAAGTA CGCCCAGGCG TTCGTGGTGC TCCCCGGGGG CTTCGGCACG ATGGACGAGC TGTTCGAGGC CCTCACCCTG GTGCAGACCG GCAAGGTGAC CCGGTTCCCG GTGGTGCTGA TGGGTGTCGA CTACTGGCGC GGCCTACTCG ACTGGCTGCG GGACACGATG GTGGCCGACG GCAAGATCGG GGCGATCGAT CTCGACCTGA TCTGCCTCAC CGACGACGTG AACACGGCGG TGCGGCACAT CGTCGAGGCC GAGGCGCTGC TCTCCGCCGA CCAGGAGGCC GTCCGTGAGG AGGCGGTCGC TGTCGCCGCC GCCGAACGGC GGGCCGCCGC CGACGAGGGG GGTCGGGGCT GA
|
Protein sequence | MSESNGQEAD GVLRRDRHRG AGLLRRSTVG GSTADQRLLD SPQRSDWKTR DAWRALRILS EFVEGFDTLS DLPSAVSVFG SARSRPDSPE CRMAEALGGA LARAGYAVIT GGGPGVMAAA NRGTREAGGL SVGLGIELPF EQGINDWVDL AIEFRYFFAR KTMFVKYAQA FVVLPGGFGT MDELFEALTL VQTGKVTRFP VVLMGVDYWR GLLDWLRDTM VADGKIGAID LDLICLTDDV NTAVRHIVEA EALLSADQEA VREEAVAVAA AERRAAADEG GRG
|
| |