Gene Rcas_3001 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_3001 
Symbol 
ID5540497 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp3894042 
End bp3895193 
Gene Length1152 bp 
Protein Length383 aa 
Translation table11 
GC content63% 
IMG OID640895123 
ProductGntR family transcriptional regulator 
Protein accessionYP_001433076 
Protein GI156742947 
COG category[E] Amino acid transport and metabolism
[K] Transcription 
COG ID[COG1167] Transcriptional regulators containing a DNA-binding HTH domain and an aminotransferase domain (MocR family) and their eukaryotic orthologs 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.19098 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTCCCTG TCGTTCAAAT CGTTCATCGT CCCGGCATCC TCGATTTGGG GTGGGGGCAT 
CCCGATCCTG CCGTGTTACC GGTCGCATCC ATACAACGTG CCGCAACAAC CGTGCTCGCG
CGCTACGGCG CCGATGCCCT GGCGTATGGC GCCGAGCGCG GACCAGGACC GCTGATCGCG
TGGATCTGCT CTCGTCTGAA GCGCACCGAT GCGCGCGCGC CTGATCCGGT TGAGGTGGTG
ATCACCGCTG GCGCCTCGCA GACGCTCGAT CTCCTCTGCA CACTCTGCAC GCATCCCGGC
GATGTCGTGC TGGTCGAGTC GCCGACGTAT CATCTGGCGG TGCGTATCCT GCGCGATCAT
CCGCTGAATC TGGTCGCCGT CCCTTCCGAT GCCGACGGCA TTGATGTTGA TGCGCTGAAA
AGTCTTCTGG AGCGGATGGC GCGGCGCGGC AGAACGGCGC GTCTGCTCTA CTTCGTTCCC
ACGCACCACA ACCCGACCGG CGTATGCATG AGCCTTGAAC GTCGCAGAGC GCTGGCGGCA
GTCGCCGCAG CGTATGGGGT GCTCCTGGTG GAAGACGACG TGTACCGTGA ACTGAGTTAC
GATGCTCCTG CGCCGCCGTC AGTGTGGAGT CTGGCGCCTC CGGGCATCGT GGCGCGCATC
GGCTCCTTCT CAAAATCGCT GGCGCCAGGG TTGCGCCTGG GCTATCTGAC TGCTGATGCG
TCATTGACCC GGCGATTGAT CGGCAGCGGA TTGCTCGACA GCGGCGGTGG TCTCAACCCT
TTTGTGGCGC TGACGGTTGC CGAAGTCTGC ACATCGGGCG ATTTCGATGC AACAATTGCG
CGCCTTTGCG CCACTTACCG TGAGCGACGT GATGCGCTGG CAGGTGGTCT GCGTGATTAC
CTGCCGCCGG GATGCCATTG TACCGTGCCC GGCGGCGGTT TCTTTCAATG GGTGGCATTG
CCGGAAGGGA TCGATGCCGG GGAATTGTTG CCGTTCGCCG AATCCATGGG GGTCTCATAC
CTGCCCGGAT CACGGTTTTA TCTCGACGCG CCACGACGTA ACACGCTCCG GTTGGCATTC
AGCCTGTACC AACCGCCTCA ACTCATCGAA GCGGCGCGGC GTCTGGGAGA AGCCATTGCG
ACCGTCGTCT GA
 
Protein sequence
MLPVVQIVHR PGILDLGWGH PDPAVLPVAS IQRAATTVLA RYGADALAYG AERGPGPLIA 
WICSRLKRTD ARAPDPVEVV ITAGASQTLD LLCTLCTHPG DVVLVESPTY HLAVRILRDH
PLNLVAVPSD ADGIDVDALK SLLERMARRG RTARLLYFVP THHNPTGVCM SLERRRALAA
VAAAYGVLLV EDDVYRELSY DAPAPPSVWS LAPPGIVARI GSFSKSLAPG LRLGYLTADA
SLTRRLIGSG LLDSGGGLNP FVALTVAEVC TSGDFDATIA RLCATYRERR DALAGGLRDY
LPPGCHCTVP GGGFFQWVAL PEGIDAGELL PFAESMGVSY LPGSRFYLDA PRRNTLRLAF
SLYQPPQLIE AARRLGEAIA TVV