Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sare_4552 |
Symbol | |
ID | 5705814 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salinispora arenicola CNS-205 |
Kingdom | Bacteria |
Replicon accession | NC_009953 |
Strand | - |
Start bp | 5144395 |
End bp | 5147526 |
Gene Length | 3132 bp |
Protein Length | 1043 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | 641273964 |
Product | transcriptional regulator |
Protein accession | YP_001539311 |
Protein GI | 159040058 |
COG category | [R] General function prediction only [T] Signal transduction mechanisms |
COG ID | [COG3629] DNA-binding transcriptional activator of the SARP family [COG3903] Predicted ATPase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 7 |
Fosmid unclonability p-value | 0.01867 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCACGTCG GGATGCTCGG CCCTCTGCTG GTGACCGCCG GCGGAACCGA GGTGCGGATC GGTGGTGCCC GGCTGCGCAC CCTGCTGATC CGGCTGGCGC TGGAACCCGG GCGGCCCGTG CCGACCGAGT CGCTGACCCG GTCACTGTGG CCGGAGGACC GGCTCACCGA CACCTCGCAC GCGCTGCATG CACTCGTCTC ACGGCTGCGC AGGTCACTGC CAGAGCCCGC CGTGGTGGAG GGCATCCCAG GCGGATACCG GCTGCGGCTG CCCCCGGCAT CGGTCGACGT CACGCACTTC GAGCAGCTGC GACAGGAGGG TCAGCGCCGG CTCCGTGAGG GCGACCCGGC GCACGCCGGC CGGATGCTCC GGGAAGCGCT CGCCCTGTGG CGCGGCGAGC CGCTCGCCGA CGTGCGGGAC CTGCCGTTCG CGGCGCAGGA AGTCAACCGG CTCACCGAAC TGCGGCTCAC CGCGCTGGAG GACCGCGTCG CCGCCGACCT GGCGTGCGGC GCCGACGACC TGGTCGCGGA ACTGCAGGGG CTGACGGCGA GCTACCCGTC CCGGGAGCGG CTGCATGCGC TGCTGGTCCG CGCCCTGCAC GCGGAGGGCC GCCAGTCGGA GGCGCTGCGC ACCTACGCCG GCTATCGGCG CTACCTCGCT GATCAGCTCG GCAGCGACCC TGGGCCCGAA CTGCGCGCCG CTCACCTGGC GGTGCTGCGG GACGACCGCG GCACCGAGCG GTCCCGGGGC AACCTGGGTG CACCGCTGAC GTCGTTCGTC GGCCGGGCCG CGGAACGCCG CCGGATCCAC GAGCAGCTGC GCGAGCAGCG CCTGGTGACG CTGGTCGGCA CCGGCGGCGT GGGCAAGACC CGGCTGGCGA CCACGCTGGC CGCCGAACTG GCCGACCGGA CCTCGGACGG CGTCTGGCTG ATCTCGCTGG CCACGGCAAC CGCCGCCACC GACGTGCCGC AGACGATGCT CCACACCCTG GGGGTACGTC CCGCCGACCG GTCCGCCGAC CCGGTACGCG CACTGGTCGC CGCGCTGGCG CCGACCGAGA CCGTGCTCAT CATGGACAAC TGCGAGCATG TCATCGAGGC AGCGGCCCGG GTCGTCGAGC AACTTCTGGT GGGCTGCCCG CGGCTGCGGA TCGTGGCGAC CAGCCGGGAA CCGCTGATGA TCCCGGGCGA GGCCCTGAGC CCGGTGCCGC CGCTGCCGGT GCCGCCATCG GGGACGCCGC TGCCCAAGGC GCTGGATTCT CCCGCGGTGC GGCTGCTCGT CGAGCGCGCC CGCGCGGCAC ACCCCGCGTT CGCCGTGACC GAGAAGAACA TCGGGCACAT CGTGGAGACC TGCCGCCGGC TGGACGGCCT GCCGCTGGCC ATCGAGCTGG CCGCCGCCCG GCTGCGGTCC ATGTCGATCG AGCACCTGGC CGCCCGCCTG GACGACCGGT TCCGTCTACT CACCGGTGGC AGCCGGACCG CGCTGTCCCG TCACCAGACC CTGCACGCCG CAGTGACCTG GAGCTGGGAC CTGCTGAGCG AGCCGGAGCG GCGGGCGCTC CGCAGCGTGG CGGTCTTCTC GGGCTCGTTC GACGCCGCGG CGGCCGAGTC ACTCGGAGTC GCGACGGAAC TGCTCGACGC CCTCTTCGAC CGGTCGCTCA TCACCCTGAT CGACGGCCCC GAACCCCGCT ACGCCGTGCT GGAGACGATC CGGGAGTACG CCCTGCAGCA CCTGACCGAA GCCGGCGAGG TGCTCCGGAT GCGACACGAC CACGCGGCAC ACTTCCTGGC GTTGGCAGAG CAGGCGGCAC CACATCTGCG CGGCCCGCGA CAGCACCCGT GGATGCTGCG GCTCGACGCG GAGAGCGGCA ATCTGCTGGC GGCCCTGCGC TTCGCGACCG ACTTCGGCGA CGCGGACACC GCGGTCCGGA TGGCTGCCGC CCTCTGGTAC GCCTGGGTAG TCAACAGCGA GCACACCGAG GCGGTCGAGC GGCTGCGCCG AGCCCTGGCG ATGCCCGGCC CGGTGCGGGC GCACGCCCGC CGTACTGCTG CGATCGGCCT GCTCTTCAGC AGCGTCCTCG GCGGCGACCG GGAGGCAATG CGGGACGCCC GGCGCCGGGT GCTCGACGAC GGCACACTGC CGCCGGCGGA CCCCCTGGCC ACGGCGCTGC TGGCGGTGAC CTCCGACGAC CCCGCCCCGG TGTTCGCCGC CGACGGGCCG GAGACCGACC CGTGGGAGCG GGGTCTGCTC TGGTGGATCA GGTCGTTCCT CAGCGCGAGG CGGGGTGAAG CCGCCACGCT GTGCGACGCC CTCACCCGCG CCGAGGACGG ATTCCGCCGG GCCGGGGACC GCTGGGCGCT CGCCATGTGC CTGCTGAGCA CGAGCGACGC CCGGCTGACG GTCGGCGATC TGGACGCCAG CCTGCGTGCT CTGGAGGAGT CGACGGAACT GGCGCACGGC TTGGGCACTA ACGACCAGCA GCGACTCTGG CTGGCGGTCG TTCGGCTGCG CAGCGCCGAC GTCCGCGGGG CCCGCGCCGA GCTGCTGAGC ATCGTGGAGC AGGCGTCGGC CGGCCGTTAC GCGTCCACCG CCCGGATCTT CCTGGCCGAC CTGTGCCGCC AGGAGGGCGA CTTGGACGCC GCCGCTCGCC AGCTGGAGCA CGCCGCCAAC GACCGCGGGG CCCAGCAGGA CCGGGTTTTC CGGTCGCTGT ACCGGTTGTC GGCCGGCCAC CTGGCCGTGG CCCGCGGCGA CCTGCGCGGC GCGGCGCGGG ACCTGCGCGA GGGTCTGGAC CTGATCGCGG CGATGCCGCA CGTGCCGATG GGTGCCACGG TCGGTGCCGG CGTCGCGGCC CTGCTGTTGC GTGCCGGTTC GCCGGCGTCG GCGGCCCAGG TGCTCGGCGC CGGCCGCGCA CTGACCGGTG CGGCCAACGC CGACGTCCTG CGCCTCGAGG AAGAGCTCGG CGAACAGCTG GGCACGAGCG GGTATGCGGA CGCCTGCCGC CTGGAACCCC CCGCCGCCCT GGCCCTCATC CAGCAGAGTC TCGCCGCCTT CACTCCCGAC GCTGGTAGGC GAGCACGGCC AACGGGCAGA AGATCACGAG CAGGACAGCC GACCAGATCA AGGTCTGGGT GA
|
Protein sequence | MHVGMLGPLL VTAGGTEVRI GGARLRTLLI RLALEPGRPV PTESLTRSLW PEDRLTDTSH ALHALVSRLR RSLPEPAVVE GIPGGYRLRL PPASVDVTHF EQLRQEGQRR LREGDPAHAG RMLREALALW RGEPLADVRD LPFAAQEVNR LTELRLTALE DRVAADLACG ADDLVAELQG LTASYPSRER LHALLVRALH AEGRQSEALR TYAGYRRYLA DQLGSDPGPE LRAAHLAVLR DDRGTERSRG NLGAPLTSFV GRAAERRRIH EQLREQRLVT LVGTGGVGKT RLATTLAAEL ADRTSDGVWL ISLATATAAT DVPQTMLHTL GVRPADRSAD PVRALVAALA PTETVLIMDN CEHVIEAAAR VVEQLLVGCP RLRIVATSRE PLMIPGEALS PVPPLPVPPS GTPLPKALDS PAVRLLVERA RAAHPAFAVT EKNIGHIVET CRRLDGLPLA IELAAARLRS MSIEHLAARL DDRFRLLTGG SRTALSRHQT LHAAVTWSWD LLSEPERRAL RSVAVFSGSF DAAAAESLGV ATELLDALFD RSLITLIDGP EPRYAVLETI REYALQHLTE AGEVLRMRHD HAAHFLALAE QAAPHLRGPR QHPWMLRLDA ESGNLLAALR FATDFGDADT AVRMAAALWY AWVVNSEHTE AVERLRRALA MPGPVRAHAR RTAAIGLLFS SVLGGDREAM RDARRRVLDD GTLPPADPLA TALLAVTSDD PAPVFAADGP ETDPWERGLL WWIRSFLSAR RGEAATLCDA LTRAEDGFRR AGDRWALAMC LLSTSDARLT VGDLDASLRA LEESTELAHG LGTNDQQRLW LAVVRLRSAD VRGARAELLS IVEQASAGRY ASTARIFLAD LCRQEGDLDA AARQLEHAAN DRGAQQDRVF RSLYRLSAGH LAVARGDLRG AARDLREGLD LIAAMPHVPM GATVGAGVAA LLLRAGSPAS AAQVLGAGRA LTGAANADVL RLEEELGEQL GTSGYADACR LEPPAALALI QQSLAAFTPD AGRRARPTGR RSRAGQPTRS RSG
|
| |