Gene Sare_4552 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_4552 
Symbol 
ID5705814 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp5144395 
End bp5147526 
Gene Length3132 bp 
Protein Length1043 aa 
Translation table11 
GC content74% 
IMG OID641273964 
Producttranscriptional regulator 
Protein accessionYP_001539311 
Protein GI159040058 
COG category[R] General function prediction only
[T] Signal transduction mechanisms 
COG ID[COG3629] DNA-binding transcriptional activator of the SARP family
[COG3903] Predicted ATPase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.01867 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCACGTCG GGATGCTCGG CCCTCTGCTG GTGACCGCCG GCGGAACCGA GGTGCGGATC 
GGTGGTGCCC GGCTGCGCAC CCTGCTGATC CGGCTGGCGC TGGAACCCGG GCGGCCCGTG
CCGACCGAGT CGCTGACCCG GTCACTGTGG CCGGAGGACC GGCTCACCGA CACCTCGCAC
GCGCTGCATG CACTCGTCTC ACGGCTGCGC AGGTCACTGC CAGAGCCCGC CGTGGTGGAG
GGCATCCCAG GCGGATACCG GCTGCGGCTG CCCCCGGCAT CGGTCGACGT CACGCACTTC
GAGCAGCTGC GACAGGAGGG TCAGCGCCGG CTCCGTGAGG GCGACCCGGC GCACGCCGGC
CGGATGCTCC GGGAAGCGCT CGCCCTGTGG CGCGGCGAGC CGCTCGCCGA CGTGCGGGAC
CTGCCGTTCG CGGCGCAGGA AGTCAACCGG CTCACCGAAC TGCGGCTCAC CGCGCTGGAG
GACCGCGTCG CCGCCGACCT GGCGTGCGGC GCCGACGACC TGGTCGCGGA ACTGCAGGGG
CTGACGGCGA GCTACCCGTC CCGGGAGCGG CTGCATGCGC TGCTGGTCCG CGCCCTGCAC
GCGGAGGGCC GCCAGTCGGA GGCGCTGCGC ACCTACGCCG GCTATCGGCG CTACCTCGCT
GATCAGCTCG GCAGCGACCC TGGGCCCGAA CTGCGCGCCG CTCACCTGGC GGTGCTGCGG
GACGACCGCG GCACCGAGCG GTCCCGGGGC AACCTGGGTG CACCGCTGAC GTCGTTCGTC
GGCCGGGCCG CGGAACGCCG CCGGATCCAC GAGCAGCTGC GCGAGCAGCG CCTGGTGACG
CTGGTCGGCA CCGGCGGCGT GGGCAAGACC CGGCTGGCGA CCACGCTGGC CGCCGAACTG
GCCGACCGGA CCTCGGACGG CGTCTGGCTG ATCTCGCTGG CCACGGCAAC CGCCGCCACC
GACGTGCCGC AGACGATGCT CCACACCCTG GGGGTACGTC CCGCCGACCG GTCCGCCGAC
CCGGTACGCG CACTGGTCGC CGCGCTGGCG CCGACCGAGA CCGTGCTCAT CATGGACAAC
TGCGAGCATG TCATCGAGGC AGCGGCCCGG GTCGTCGAGC AACTTCTGGT GGGCTGCCCG
CGGCTGCGGA TCGTGGCGAC CAGCCGGGAA CCGCTGATGA TCCCGGGCGA GGCCCTGAGC
CCGGTGCCGC CGCTGCCGGT GCCGCCATCG GGGACGCCGC TGCCCAAGGC GCTGGATTCT
CCCGCGGTGC GGCTGCTCGT CGAGCGCGCC CGCGCGGCAC ACCCCGCGTT CGCCGTGACC
GAGAAGAACA TCGGGCACAT CGTGGAGACC TGCCGCCGGC TGGACGGCCT GCCGCTGGCC
ATCGAGCTGG CCGCCGCCCG GCTGCGGTCC ATGTCGATCG AGCACCTGGC CGCCCGCCTG
GACGACCGGT TCCGTCTACT CACCGGTGGC AGCCGGACCG CGCTGTCCCG TCACCAGACC
CTGCACGCCG CAGTGACCTG GAGCTGGGAC CTGCTGAGCG AGCCGGAGCG GCGGGCGCTC
CGCAGCGTGG CGGTCTTCTC GGGCTCGTTC GACGCCGCGG CGGCCGAGTC ACTCGGAGTC
GCGACGGAAC TGCTCGACGC CCTCTTCGAC CGGTCGCTCA TCACCCTGAT CGACGGCCCC
GAACCCCGCT ACGCCGTGCT GGAGACGATC CGGGAGTACG CCCTGCAGCA CCTGACCGAA
GCCGGCGAGG TGCTCCGGAT GCGACACGAC CACGCGGCAC ACTTCCTGGC GTTGGCAGAG
CAGGCGGCAC CACATCTGCG CGGCCCGCGA CAGCACCCGT GGATGCTGCG GCTCGACGCG
GAGAGCGGCA ATCTGCTGGC GGCCCTGCGC TTCGCGACCG ACTTCGGCGA CGCGGACACC
GCGGTCCGGA TGGCTGCCGC CCTCTGGTAC GCCTGGGTAG TCAACAGCGA GCACACCGAG
GCGGTCGAGC GGCTGCGCCG AGCCCTGGCG ATGCCCGGCC CGGTGCGGGC GCACGCCCGC
CGTACTGCTG CGATCGGCCT GCTCTTCAGC AGCGTCCTCG GCGGCGACCG GGAGGCAATG
CGGGACGCCC GGCGCCGGGT GCTCGACGAC GGCACACTGC CGCCGGCGGA CCCCCTGGCC
ACGGCGCTGC TGGCGGTGAC CTCCGACGAC CCCGCCCCGG TGTTCGCCGC CGACGGGCCG
GAGACCGACC CGTGGGAGCG GGGTCTGCTC TGGTGGATCA GGTCGTTCCT CAGCGCGAGG
CGGGGTGAAG CCGCCACGCT GTGCGACGCC CTCACCCGCG CCGAGGACGG ATTCCGCCGG
GCCGGGGACC GCTGGGCGCT CGCCATGTGC CTGCTGAGCA CGAGCGACGC CCGGCTGACG
GTCGGCGATC TGGACGCCAG CCTGCGTGCT CTGGAGGAGT CGACGGAACT GGCGCACGGC
TTGGGCACTA ACGACCAGCA GCGACTCTGG CTGGCGGTCG TTCGGCTGCG CAGCGCCGAC
GTCCGCGGGG CCCGCGCCGA GCTGCTGAGC ATCGTGGAGC AGGCGTCGGC CGGCCGTTAC
GCGTCCACCG CCCGGATCTT CCTGGCCGAC CTGTGCCGCC AGGAGGGCGA CTTGGACGCC
GCCGCTCGCC AGCTGGAGCA CGCCGCCAAC GACCGCGGGG CCCAGCAGGA CCGGGTTTTC
CGGTCGCTGT ACCGGTTGTC GGCCGGCCAC CTGGCCGTGG CCCGCGGCGA CCTGCGCGGC
GCGGCGCGGG ACCTGCGCGA GGGTCTGGAC CTGATCGCGG CGATGCCGCA CGTGCCGATG
GGTGCCACGG TCGGTGCCGG CGTCGCGGCC CTGCTGTTGC GTGCCGGTTC GCCGGCGTCG
GCGGCCCAGG TGCTCGGCGC CGGCCGCGCA CTGACCGGTG CGGCCAACGC CGACGTCCTG
CGCCTCGAGG AAGAGCTCGG CGAACAGCTG GGCACGAGCG GGTATGCGGA CGCCTGCCGC
CTGGAACCCC CCGCCGCCCT GGCCCTCATC CAGCAGAGTC TCGCCGCCTT CACTCCCGAC
GCTGGTAGGC GAGCACGGCC AACGGGCAGA AGATCACGAG CAGGACAGCC GACCAGATCA
AGGTCTGGGT GA
 
Protein sequence
MHVGMLGPLL VTAGGTEVRI GGARLRTLLI RLALEPGRPV PTESLTRSLW PEDRLTDTSH 
ALHALVSRLR RSLPEPAVVE GIPGGYRLRL PPASVDVTHF EQLRQEGQRR LREGDPAHAG
RMLREALALW RGEPLADVRD LPFAAQEVNR LTELRLTALE DRVAADLACG ADDLVAELQG
LTASYPSRER LHALLVRALH AEGRQSEALR TYAGYRRYLA DQLGSDPGPE LRAAHLAVLR
DDRGTERSRG NLGAPLTSFV GRAAERRRIH EQLREQRLVT LVGTGGVGKT RLATTLAAEL
ADRTSDGVWL ISLATATAAT DVPQTMLHTL GVRPADRSAD PVRALVAALA PTETVLIMDN
CEHVIEAAAR VVEQLLVGCP RLRIVATSRE PLMIPGEALS PVPPLPVPPS GTPLPKALDS
PAVRLLVERA RAAHPAFAVT EKNIGHIVET CRRLDGLPLA IELAAARLRS MSIEHLAARL
DDRFRLLTGG SRTALSRHQT LHAAVTWSWD LLSEPERRAL RSVAVFSGSF DAAAAESLGV
ATELLDALFD RSLITLIDGP EPRYAVLETI REYALQHLTE AGEVLRMRHD HAAHFLALAE
QAAPHLRGPR QHPWMLRLDA ESGNLLAALR FATDFGDADT AVRMAAALWY AWVVNSEHTE
AVERLRRALA MPGPVRAHAR RTAAIGLLFS SVLGGDREAM RDARRRVLDD GTLPPADPLA
TALLAVTSDD PAPVFAADGP ETDPWERGLL WWIRSFLSAR RGEAATLCDA LTRAEDGFRR
AGDRWALAMC LLSTSDARLT VGDLDASLRA LEESTELAHG LGTNDQQRLW LAVVRLRSAD
VRGARAELLS IVEQASAGRY ASTARIFLAD LCRQEGDLDA AARQLEHAAN DRGAQQDRVF
RSLYRLSAGH LAVARGDLRG AARDLREGLD LIAAMPHVPM GATVGAGVAA LLLRAGSPAS
AAQVLGAGRA LTGAANADVL RLEEELGEQL GTSGYADACR LEPPAALALI QQSLAAFTPD
AGRRARPTGR RSRAGQPTRS RSG