Gene Rcas_1567 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_1567 
Symbol 
ID5539043 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp2011958 
End bp2013151 
Gene Length1194 bp 
Protein Length397 aa 
Translation table11 
GC content62% 
IMG OID640893705 
ProductGntR family transcriptional regulator 
Protein accessionYP_001431678 
Protein GI156741549 
COG category[E] Amino acid transport and metabolism
[K] Transcription 
COG ID[COG1167] Transcriptional regulators containing a DNA-binding HTH domain and an aminotransferase domain (MocR family) and their eukaryotic orthologs 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones33 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTGCTG TTTCCGCGCC CGCCCTGGGT AACCTGTACG CTGCGCGGGC GAAGAATCTC 
GCCCCGGCTC AAATCTGGCC CGAACACGAG GGCGATCTGA TCTCGCTGGC ATACGGTTTT
GCTGCGCCGG AACTGTTCCC AACTGACGAT CTCCTCTCCG CCACTGCCGA AGTGTTGGCG
GAGGATGCGG CGGAAGGGTT GAACTACGCG CCAACCTACC CCGGTCTGGT GCAGTTCGTT
GCGAATCGCC TGCGCGCTCA GGGAACCCCG GCTGAACCGG GCACTGTGCT GATTTCCTAC
GGTTCCAGCC AGGTGCTGGC GTTGCTGCCG CAGGTGTTCA TCGATCCCGG CGATGCCGTG
ATCGTCGAAG GACCGACGTT TATGGGTGCG GTGCGCCATT TTGCGCTGGC AGGCGCGCGC
CTGATCACCG TCGATGTCGA TGAGTACGGC ATGAATGTTG ATGCTCTTGA AGAGACGTTG
CGCGATCTGG CGCAGCGCGG TCAACGCCCC AAGTTCATCT ATACGATCCC GACCTTTCAT
AATCCGAGCG GCGTGCTGAT GCCGCTGGAG CGCCGACAGC GACTGGTGGC GCTGGCGAAG
GAGTATGGGG TGCTGATCGT TGAAGACGAC GCCTACGGCG ATCTCTACTT CGAGAACCCG
CCGCCGCCGC GTCTCTCTGC GCTCGACCAT GAAGGGTGGG TGTTGCAGGT CGGCACCTTC
TCGAAAATCC TGGCGCCGGG GTTGCGTATG GGGTGGGCAT GCGGCAATCA CGAGATCATT
CAGCGCCTGG CAAGTTTCAA GGTCGAAGGG TCGAGTGGTC CCTTCCTGAC GCGCATGGTC
GAGCGGTGCT GCGCCGATGG CCGCCTGGAG CGGCATATCG CCGAACTGCG CGCGGCGTAT
CGGGCGCGAC GCGACCTGAT GCTGACAGTG ATTGCGCGTG AATGGACTCC GGAGGTGCGC
GTCACAAAAC CTGAGGGTGG GTTCTTTATC TGGGCGCGGT TGCCGCAGGG TGTGAGCGCA
ACCGCGTTGC TCGCCGAAGC GGAGAAGCAT GGGGTGACGT TTTCGCCAGG AACGCATTTC
TATGCAAACG GTCAGGGAGA TGATGCGTTC CGCCTGTCGT TCAGTTTTGT TCCGCATCAG
CAGATCGAGG ATGGCATCGC GCGCATCGGT GCTGCATTGC GGATGTTCCA GTAA
 
Protein sequence
MSAVSAPALG NLYAARAKNL APAQIWPEHE GDLISLAYGF AAPELFPTDD LLSATAEVLA 
EDAAEGLNYA PTYPGLVQFV ANRLRAQGTP AEPGTVLISY GSSQVLALLP QVFIDPGDAV
IVEGPTFMGA VRHFALAGAR LITVDVDEYG MNVDALEETL RDLAQRGQRP KFIYTIPTFH
NPSGVLMPLE RRQRLVALAK EYGVLIVEDD AYGDLYFENP PPPRLSALDH EGWVLQVGTF
SKILAPGLRM GWACGNHEII QRLASFKVEG SSGPFLTRMV ERCCADGRLE RHIAELRAAY
RARRDLMLTV IAREWTPEVR VTKPEGGFFI WARLPQGVSA TALLAEAEKH GVTFSPGTHF
YANGQGDDAF RLSFSFVPHQ QIEDGIARIG AALRMFQ