Gene RoseRS_2354 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRoseRS_2354 
Symbol 
ID5209323 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus sp. RS-1 
KingdomBacteria 
Replicon accessionNC_009523 
Strand
Start bp2911204 
End bp2914416 
Gene Length3213 bp 
Protein Length1070 aa 
Translation table11 
GC content61% 
IMG OID640595960 
Producttranscriptional activator domain-containing protein 
Protein accessionYP_001276682 
Protein GI148656477 
COG category[K] Transcription 
COG ID[COG2909] ATP-dependent transcriptional regulator 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.607566 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGCTGC GATTTCAGCA GAAGCTCGTT GTTCCAACGT CGGCGCGACC GCTGATCGAA 
CGACCGAACG TTCTCGCGCA GCTTGACCGT GCCATCCGGA GCAAGCGCGT CGTTGCGCTG
GCTGCGCCCG CCGGCTGGGG GAAAACGACT GCGCTGGCGC AGTGGGTTGC GCAGCGCACG
ATGCCGGTTG CCTGGTATAC CCTCGATAGC GCCGATCGCG ATCCGCACGT CTTCCTCGAC
TACCTGCTCC ACAGCGTTGC CGATTTTGTC CCAGGAGCGC CGGGTATCGC AGCGCGGCTC
GCCGAAACGA CCCCGCAAGG GCTGGCGGAG ATGATCCACC AGGTGGCGCT TGCATTTGCG
GATGCGCCTG AACATTTTGC GCTGATCCTC GATGATGTCC ATGTACTTGA CGATGACCAG
GCGCAATCCA TTCCGGGGGT ATCGCTCGTC TTTACGCTAC TCGCTTCGAT CGCGGAATAC
GCCTCCCACT GCCATCTGGT GCTCGCTTCG CGCACCCTGC CGGTGTTGCA TGGCATGGTA
CGCATGGTCG CACAGCAGCG CGCTGCGGTC TTCGATTATA GCGTGTTACA GTTCCATCGC
GACGACACGC AGCGTCTTGC TGGCATAACA AGCGGGTTGA TCCTCTCCGA CGAAGCAGCG
GAACAGTTGA CCGCATCAGT TGGCGGCTGG GTTACTGGCA TTGTCCTCTC GCTGGATCAA
CCGGTTGCGC ACAAAGACGG GATGACAGGT CAGCAGGTTG TTGATCGACA CCTGATGCAC
GTTCCCACAC CTGAAGAGGC GGTCATCGAA GCCAATACCA GTCAGGTATA CGCTTACTTT
GCCGAGCAGA TTCTGGCACC GTTGCCAGCG GATCTGCAAC GTTTCCTCGA AGACACCAGC
GTTCTGCACG ATCTATCGCC ACAGCGCTGC GATATACTGC GTTCTGCCGA TAACTCGGCA
GAATATCTCG ATGAGATCAG GCGGCGCGGG TTATTCGTCT CCAGCCGCGC CGGATGGCTC
TCGTATCACA GCCTGTTTCG CGATTATCTG CGCTCACGCC TGGCGCGCGA TCCGCAACGA
TGCCGGCAGT TGCTGCGTTC CGCCGGTGAC CTGTATGCTG CTGAAGATGA TATCGAGCGT
GCGCTTGACT GCTACCTGGC AGCCGGCGAC GATCAGCACG CTATCGATCT GATCCGTTCC
GCCGTGCCGC GCCTGCGGCA GTGTTCGCGC CAGACAACCC TGCTGACCTG CTTCGAACGT
CTGCACCGCG CGCGCAGGAC AAATGATCAG CGCACAGCCA GCGGAGGATT CCCGCCAGCA
ACACGCGCCG CACGCAAGCA GTCGATCCCG CCCGATCTGC TGCTGGCAGA GGCGCGTGTC
TACAGCGACC TGGCGCTCTG GGAACGTGCA TATCTGGCGC TTCACCTTGC CGGGGCGATC
GGGGATGCTT CTATCCGTGC CGAGGCGCAG ATCCTCTCTG CCGAGTTGCA GGTGCTCCAG
GGTGACTACG CCCGCGCTCA GCATGCGCTC AGATTCGTCG ATGTCAACAA CCTCGACAAT
CGATTGCGCC TGGAATACCA TGTCGCGGCT GCCCGCGCGC ATATTATGGC AGGAGAAGTT
GCCGCCGCCA TCACCGAACT CGAACGCGCG CATACGCTTG TCGCAACCCT GATCGACACC
GTCGATAACC CTGCCGCACT GGCGGATATT TACGACAACC TTGGCTGGGC ATACGCCGCG
CAGGGCGATC GTCCATCCGC CATTCGCTAT CTGAAACGCG CTGATGCCTG CTGGCAGTCT
TCCGGTAACC AGGGAAGACG CGCGCTGACC CTCAACAATA TGGGGGTGAT GGCGATGGAG
GAAGGGCGGT TTGCCGAAGC CCGCACAACA CTCGATCTGG GATTGGAGAT CGCCCGGCAG
ACAGAGTTAC GACGTGAAGA AACGGTTCTG CTGTGTAGTC TGGCAGAACT CGATCTGCGT
GAAGGCGATT TTGAACAGGC GATTCAGCGT TTTACCACAG CGCACGCCCT GGCAACCCGT
CTCGATATTG CCAGCAGCGT GGAAGCAGCC GCAGCGGGAG CGCTCTGGGC TGCCATACTT
TCCGACAACC TGACGCTCGC ACAGGCATGG CACGACACGG CAGCCTCCAT CGTGACACCG
CTACAACCGG AGGTGCGCGG ACGACTGGCA TTGGCACGCG CAGCGCTCGC GTTGCAATGC
CCGCACCCGG AATTTGCGTC TTTCGCCAGT TTATTAGCCG AAGCGACCAC ATACGAAGAA
TCCCTCAGCG AGGATGAGCG CATCTATATG GCGCTATTGC GTACTGAACT CGCCTTCGCC
CGGTCAGGCT GGCACGCAGC GGCATCGCTG TGGGAACAGT TCGCCGCACG CGCGGCCACG
CTGCCCGAAA CCCTGTGGCG CCGCTTTGCG TCGATGCATC GTCCCCTCTT CGAAGCCGCT
GCACCATACG ATCCGCGCGC CGGTCGCGCG ATTGCGCTGT CTCGCACAGC ATCTCCATCT
TCTGTACGCT GGCGAATAAC GACACTCGGC GGATTCGCGT GTCTGGTCGA TGGGAAACCG
GTCGATCTTT CGCAGTTGCA CCGCGCGCTG CTGGTGCGGC TCCTTGATGC CGGTCCGCAA
GGTCTGGCAG TCGAGCGGCT GTGGGAAGCG GTTTGGGGCG ACGATGTTAT TTCGATGCCG
GCGCTGCATC AGGCGCTGCG TCGGTTACGC CTGCAAACCG GACTCGCCGC TTCGGCGCGC
GAAGGCGCTG TAGCAATCCG CAGCGGTTGG GACGCGATTG AATACGATGT GCGTGAACTG
GAACGCATTC TTGAAACGCC TCCCAGCCTC GAATCCATCC AACGCGCAAT GACGCTCTAC
GGCGGTGAGT TTCTGCCTGG CGCCCCGGCA AGCGCTGCGC TTTGGGTCGA GGCGCGTCGG
GCGCACCTGC AGCAACGCTA CCTTGAGGCT ATTGAACAGT ACGCGCACTC CATCGAACAG
AACTTGCCGC AGCAGGCGAT GTTCTACTAT CAGCACGTGC TTCAGATCGA TGGATGCCGC
GAGCATACCG CCGCCCGGTT GATGCGCCTT GCGGCACGGT ACGGCAACCG CACCCTGGTC
GCCGCCACCT TTGAGCATCT GAAAGGATCG TTACGCGCGC TCGGCGCCTC ACCAGAACCG
GCAACCACTG CACTATACCG GCAACTGACC TGA
 
Protein sequence
MTLRFQQKLV VPTSARPLIE RPNVLAQLDR AIRSKRVVAL AAPAGWGKTT ALAQWVAQRT 
MPVAWYTLDS ADRDPHVFLD YLLHSVADFV PGAPGIAARL AETTPQGLAE MIHQVALAFA
DAPEHFALIL DDVHVLDDDQ AQSIPGVSLV FTLLASIAEY ASHCHLVLAS RTLPVLHGMV
RMVAQQRAAV FDYSVLQFHR DDTQRLAGIT SGLILSDEAA EQLTASVGGW VTGIVLSLDQ
PVAHKDGMTG QQVVDRHLMH VPTPEEAVIE ANTSQVYAYF AEQILAPLPA DLQRFLEDTS
VLHDLSPQRC DILRSADNSA EYLDEIRRRG LFVSSRAGWL SYHSLFRDYL RSRLARDPQR
CRQLLRSAGD LYAAEDDIER ALDCYLAAGD DQHAIDLIRS AVPRLRQCSR QTTLLTCFER
LHRARRTNDQ RTASGGFPPA TRAARKQSIP PDLLLAEARV YSDLALWERA YLALHLAGAI
GDASIRAEAQ ILSAELQVLQ GDYARAQHAL RFVDVNNLDN RLRLEYHVAA ARAHIMAGEV
AAAITELERA HTLVATLIDT VDNPAALADI YDNLGWAYAA QGDRPSAIRY LKRADACWQS
SGNQGRRALT LNNMGVMAME EGRFAEARTT LDLGLEIARQ TELRREETVL LCSLAELDLR
EGDFEQAIQR FTTAHALATR LDIASSVEAA AAGALWAAIL SDNLTLAQAW HDTAASIVTP
LQPEVRGRLA LARAALALQC PHPEFASFAS LLAEATTYEE SLSEDERIYM ALLRTELAFA
RSGWHAAASL WEQFAARAAT LPETLWRRFA SMHRPLFEAA APYDPRAGRA IALSRTASPS
SVRWRITTLG GFACLVDGKP VDLSQLHRAL LVRLLDAGPQ GLAVERLWEA VWGDDVISMP
ALHQALRRLR LQTGLAASAR EGAVAIRSGW DAIEYDVREL ERILETPPSL ESIQRAMTLY
GGEFLPGAPA SAALWVEARR AHLQQRYLEA IEQYAHSIEQ NLPQQAMFYY QHVLQIDGCR
EHTAARLMRL AARYGNRTLV AATFEHLKGS LRALGASPEP ATTALYRQLT