Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sare_4540 |
Symbol | |
ID | 5705981 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salinispora arenicola CNS-205 |
Kingdom | Bacteria |
Replicon accession | NC_009953 |
Strand | + |
Start bp | 5132856 |
End bp | 5134376 |
Gene Length | 1521 bp |
Protein Length | 506 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 641273954 |
Product | XRE family transcriptional regulator |
Protein accession | YP_001539303 |
Protein GI | 159040050 |
COG category | [J] Translation, ribosomal structure and biogenesis |
COG ID | [COG0242] N-formylmethionyl-tRNA deformylase |
TIGRFAM ID | [TIGR00079] peptide deformylase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.129106 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 6 |
Fosmid unclonability p-value | 0.00302013 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGACAACCT CACCGCTCGA TCGGGCTGCC GATTCCTTCG CCGTCGAACT CGCCCGCCAC CGGACCGGGC GAGGGCTGTC CAAGAAACAG CTGGCGACCC TGATGGGATT CGACCCCTCG TACGTCAGCC ACGTCGAGGG ACGTCGGCAC CGTCCCACCG AGGACTTCGC CCGGCGGGCG GAGGCCGTAC TGGAAGCCAG CGGGGCGATC TGGCAGCGCT TCCGCGACTA CGACGAACTC CGCCACACCC GCGGCGACCG GGCGCCGCAC CGCGAGCCAC CCGCCCCCGG GCAGTGGCTG CCCCCCGGTA CCGGCCTGAT CGTCGAACGG GAACAGGCCA CCCTCTCCTA CCGGGACGAG ACATACCGCT GCGTCATCCG TCGTGAGCTG TACAACGCGG GGAGCGAGCC GATCACCCGC TACCTCGTAC GTGTCGCCGT TGACCGATAC CCCGGCGACC CCGGCCGCTC CAACCGGCAT CACCGGGAAC ACCCGCTCAC CTTCGCCGAG CTACAGCTCC ACGCGCACCG ACTGGACGGA CACGAGCGTG AGGCGATGCA CTGGCGAACC AAGCACGACC GAGACGCGTT CAAGGAGATC TGGCTGCTCT TCGAGAACAC CGGCCGGCGC TTCCCGCTCT ACCCGGGCGA TCGCGCCACC ATCGAGTACG CCTACCACGT CGGGCCGGAC AAGTGGGGCC CGTGGTTCCA ACGGGCCGTC CGGGTGCCCA CCCGGCACCT CGCCGTGCGT CTCGACCTGC CGGCCGCGCT CGACCCACAG GTCTGGGGCG CAGAGACCTC ACTCTCCGCG GAGGAAGGCC CGCTGCGCAC CCCGGTGGCC CGCCGAGAGG AGGACGACCG CATCATCTTC GACTGGGCAG TCGACCAGCC GCCGCTGAAC GGCCGCTACC GGATGCAGTG GCGGTTCCGG GCCCAGCCCG AGGGAGAACT GACAGAAACC GGCTGGCTGC GGCCAAGCGA CCGGATGCGC GGCCTCGGCA TCCTCCAGCA CGGTGCCGAC CTACTCCGCC AGCCCACCCG ACCCTTCGAC CTGCCGCGCG AGGACCGAGC TGCCCGGGAC GTCGTCGACC GGCTCACGGC TACCCTGTTC CGCCTCGACG AGCTGCACCC CTTCAGCAAA GGGGTCGGGA TCGCCGCCCC CCAACTCGGC ATCGGCCGGG CCGCCGCCGT CGTCCGGCCC CCGGACCTGT CCGGCGAACC CGTCGTCCTG CTCAACCCGA GGGTGGTCGA CGCCGCACCC GACACCGACG AACAGTACGA AGGCTGCCTC TCCTTCTTCG ACCAACGGGG CCTCGTGCCC CGCCCGCTGC GAATCGACGT CGAGCACACC CACATCGACG GCAGCCGGGT CATCACCTCA TACGAGTACG GCATGGCGCG ACTCGTGGCA CACGAGATCG ACCATCTCGA AGGGCGCCTC TACGTCGACC GCATGGCCCC GGGCGTGCCC CTGGTGCCGG TCGAGGAATA CCGGCACACC GGACAGCCCT GGCGCTACTG A
|
Protein sequence | MTTSPLDRAA DSFAVELARH RTGRGLSKKQ LATLMGFDPS YVSHVEGRRH RPTEDFARRA EAVLEASGAI WQRFRDYDEL RHTRGDRAPH REPPAPGQWL PPGTGLIVER EQATLSYRDE TYRCVIRREL YNAGSEPITR YLVRVAVDRY PGDPGRSNRH HREHPLTFAE LQLHAHRLDG HEREAMHWRT KHDRDAFKEI WLLFENTGRR FPLYPGDRAT IEYAYHVGPD KWGPWFQRAV RVPTRHLAVR LDLPAALDPQ VWGAETSLSA EEGPLRTPVA RREEDDRIIF DWAVDQPPLN GRYRMQWRFR AQPEGELTET GWLRPSDRMR GLGILQHGAD LLRQPTRPFD LPREDRAARD VVDRLTATLF RLDELHPFSK GVGIAAPQLG IGRAAAVVRP PDLSGEPVVL LNPRVVDAAP DTDEQYEGCL SFFDQRGLVP RPLRIDVEHT HIDGSRVITS YEYGMARLVA HEIDHLEGRL YVDRMAPGVP LVPVEEYRHT GQPWRY
|
| |