Gene Rcas_3677 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_3677 
Symbol 
ID5541179 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp4810382 
End bp4812460 
Gene Length2079 bp 
Protein Length692 aa 
Translation table11 
GC content58% 
IMG OID640895797 
ProductMerR family transcriptional regulator 
Protein accessionYP_001433744 
Protein GI156743615 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.819629 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.434964 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACAGAA CGGCGCATGG GTTGACATCG CTCCGCAATG GCGTCAAGAC GTTGCCCTTA 
GCGCCAATGC GCGACAATTC CGAGATTCGG ACCTTGATCG ACCGAGTGCT GCCGGTAGTT
GACGAATGGC GCGAGCATTT GACCGTCAGC ATTGGGCGAG CGACAGAACT GACCGGCCTC
AAGGATACGC AGATTCGCTA TTTTGAAGAA CTCTACCGGC AGGATCGCGA TCCGGCCGCG
TCCTCTGGCG TCACTCGCTC CTACTCGCTG ACCGATCTCC GTCGGTTGGC GGTGTTCGCC
GAACTGCTGA AGGAAGGGTA TCGTCCGGCG CAGGCGGCAG AGATTGTCCG TTCGTTTGCA
GACGCGATCG ATTATGGCGC AGCGTATACT GTCGCTGATG CATTGCGTCG CGAAGGCGAT
GCCATTACCG ATGGCTTCTT GCTGTCGCGC CTGGCAAGCC AGGTGATTTT TGCGGCGCAG
CGAGAGATCG GTGAGCGTCT GCCCGATGCC CGGATTATTG CGATGTTTCT TGTCGAGCGG
CGTATCGACG GCGTCTCAGC AGAAGAGTTT GTAGACATTG CGAGAGATAT TGCAACGCAT
CCCCAGAATG TGCTGGTTGC CGTTGATCGC GCCGCCGTCA TTGGCAACGA GTCCGACAGC
AGCGAGAGTG TGTGGAGTGA TGATGAAAGC AGCATTGTGC TCTTCTATAG CCGCGATCCC
TGGAGTGTTC CTGTGCCGGA TAGGGCGCAA TTCTGCTATT ACCGTCCGCC TTCAACGCCG
GATGCCACGG TTGTCTTTAT GGTCGAGTCT GCCGGGTATC GCAGCATTCC GCCAGAACTC
ACGGTTAAGA CGTCGGCTCG CGATCATCTC CTCGACTGGA TGGTGCGCAT GTGCCTGTCC
ATCTTTCCTG AGTTCCGGCA GGCGACGCGC GCGCACAATT ATCGCTATCG TTCCGACGGC
TATCGCCTGG CGCACACCCG CCAATCGCTA ACCCGGCTGA TGCACCGGGT GCGTGACCTC
ATTTTCGGCG ACGATGCTGA TGTGAGCGTG GCCTGTTTGC TCTTGCCCGA CGATCTCTAC
AAACCGATGT ATCTGTCGAT CCTGGCGCAT GCGGGGTACC CGGAAGCGCG GGCGCGGCAG
GCACGGCTCA TGCTCTACGG CGAAGGTGAA GGATTGAGTG GGCGCGCCTT CAATGCGCGC
GAGCCGTTTC TGACCCTCGA TGCTGAACAG GATCTGCGGG TGTTCGGCGC TCAGGACGAA
CGCTGCAAGG TGGCGCTGGC GGCGCCGCTT TTGTCACACT GGGATACGCA TCCGTTCGGG
GTGCTCTATC TGGCGTCGCA GCGCCCGGCT TCCCCGCTCT GGAGCGAAAC CGCCTTCGTT
GCGCTGGTCT TCGGCAACAT TCTCAGCGAA CTGCTCGGGC GTTGGTGGTT GACGCGCCTG
CGCCGCAAAC ATGATAATCT GCTGCACCGG TATGTGGCGG AGTATATTCG CTGGTTCAAT
GGAATGGATA TGCACGGTCC TGAGTTCAAC GAAGGTTTGC AACAGATCGA GCGGATCTGG
GAACGAATTG CCGTAACATC TGCGAGTTCC GGAAATGTGC GCACTGTCGT GGAACGCCTG
TCGCGCCAGT ATGTGACACT GGTCGTGCTC GACATCGATC GCAGCCGTCA GTTGCACAGT
CGCGGCGATG AACCGTTTCT GCTCGCTGCG CAGCGCCATG TTCACCAGGC AATTGCTCAA
ATCCTGCCCG ATGTGCGGGG GTACTGGTTC AAAAACGATC ATACGCTGCT GGTTCTCGAT
GGTTATGCGC CGGAGGAGGC AATCGTCCTG ATCCGACGGA TTGCCGGCCG GGTGCAGATG
GTGCCGGTTG TGATCGAAGG AAGAACCGAA CGCACGACGG TGACGATCAG CGCGGCTCTT
AAGTCGTTGT CGTATCAGGA ACTCTACGAT CTTTCCCACC ATGATCGCGA GGTGCTGCGG
TCGAGTCTAA TGACGATTAT CGACACTATT TATGAACAGA CGCGAGGGTG TGAACGGACG
ATCAAGGTGT TTCAGCACGG CGTATGGGAA AAAGTGTGA
 
Protein sequence
MDRTAHGLTS LRNGVKTLPL APMRDNSEIR TLIDRVLPVV DEWREHLTVS IGRATELTGL 
KDTQIRYFEE LYRQDRDPAA SSGVTRSYSL TDLRRLAVFA ELLKEGYRPA QAAEIVRSFA
DAIDYGAAYT VADALRREGD AITDGFLLSR LASQVIFAAQ REIGERLPDA RIIAMFLVER
RIDGVSAEEF VDIARDIATH PQNVLVAVDR AAVIGNESDS SESVWSDDES SIVLFYSRDP
WSVPVPDRAQ FCYYRPPSTP DATVVFMVES AGYRSIPPEL TVKTSARDHL LDWMVRMCLS
IFPEFRQATR AHNYRYRSDG YRLAHTRQSL TRLMHRVRDL IFGDDADVSV ACLLLPDDLY
KPMYLSILAH AGYPEARARQ ARLMLYGEGE GLSGRAFNAR EPFLTLDAEQ DLRVFGAQDE
RCKVALAAPL LSHWDTHPFG VLYLASQRPA SPLWSETAFV ALVFGNILSE LLGRWWLTRL
RRKHDNLLHR YVAEYIRWFN GMDMHGPEFN EGLQQIERIW ERIAVTSASS GNVRTVVERL
SRQYVTLVVL DIDRSRQLHS RGDEPFLLAA QRHVHQAIAQ ILPDVRGYWF KNDHTLLVLD
GYAPEEAIVL IRRIAGRVQM VPVVIEGRTE RTTVTISAAL KSLSYQELYD LSHHDREVLR
SSLMTIIDTI YEQTRGCERT IKVFQHGVWE KV