Gene RoseRS_2027 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRoseRS_2027 
Symbol 
ID5208989 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus sp. RS-1 
KingdomBacteria 
Replicon accessionNC_009523 
Strand
Start bp2513557 
End bp2514708 
Gene Length1152 bp 
Protein Length383 aa 
Translation table11 
GC content62% 
IMG OID640595633 
ProductGntR family transcriptional regulator 
Protein accessionYP_001276362 
Protein GI148656157 
COG category[E] Amino acid transport and metabolism
[K] Transcription 
COG ID[COG1167] Transcriptional regulators containing a DNA-binding HTH domain and an aminotransferase domain (MocR family) and their eukaryotic orthologs 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0905518 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTTCCTA TCGTTCAACT CGTTCATCGA CCCGGCATTC TCGATCTTGG TTGGGGGCAC 
CCCGACCCCG CGGCATTACC GGTGGCAGCG CTCCGGCGCG CGACGGATGC GACGCTGACG
CGCTACGGCG CCGATGCGCT GGCGTATGGC GCAGAGCGTG GACCTGGACC GCTGATTGAA
TGGATCTGCG CGCGCCTGGC GCACATCGAT GCACGCCATC CCGACCCCAC AGAGGTGCTG
ATTACATCGG GCGCATCGCA GGCGCTCGAT CTCCTCTGTA CGCTCATCGC TGCACCCGGC
GATACGGTGC TGGTCGAATC GCCGACGTAC CACCTGGCGG TGCGCATTCT GCGCGACCAT
CCATTACACC TGGTCGCCGT TCCATCCGAT GCCGACGGTA TCGATGTGGA AGCGCTGACG
ATCATCCTGA GGCAACTGGC GAAGCGCGGC AGACAGGCGC GTATGCTCTA TTTCGTTCCC
ACCTATCACA ACCCAACCGG CGTTTGCCTG AGCCTGGAAC GCCGTAGGGC GCTGGCAATG
ATTGCCGCCG AGCATGGGTT CGTTCTGGTC GAGGACGATG TGTACCGCGA ACTGAGTTAC
GATGCGCCTG CGCCGCCGTC GGTGTGGAGC ATCGCACCAC CGGGTGCAGT GGTGCGGATC
GCATCGTTCT CGAAATCGCT GGCGCCAGGA CTCCGCCTTG GTTACCTGAC CGCCGATGCA
TCGTTGACCA GACGGTTAAT CGGCAGCGGC TTACTGGACA GTGGCGGAGG AGTCAATCCA
TTCACAGCGC TCACCGTCGC CGAAGTGTGC GCTACGGGTG ATTTTGAGGC GACAGTAACA
CAGTTGCGTG CGATGTATCG GGAGCGACGC GATGCACTGG CGCAGAGCCT GCGTATGTAT
CTGCCACCCG GATGCCGATT CACCGTGCCG GGCGGCGGAT TCTTTCAGTG GGTGGAATTG
CCGGAAGGGG TCGATGCGGC AACCCTGCTG CCACGCGCTG AACAGACAGG CGTCTCCTAT
CTTCCCGGAT CACGCTTCTA TCTCGATGCA GCGCGATCCA ACACACTGCG TCTCTCATTC
AGCCTGTATC CGCCGCACCA ACTGACCGAA GCAGCGCGAC GATTGGGAGA AGCGCTTGCA
GCGATCAGGT GA
 
Protein sequence
MLPIVQLVHR PGILDLGWGH PDPAALPVAA LRRATDATLT RYGADALAYG AERGPGPLIE 
WICARLAHID ARHPDPTEVL ITSGASQALD LLCTLIAAPG DTVLVESPTY HLAVRILRDH
PLHLVAVPSD ADGIDVEALT IILRQLAKRG RQARMLYFVP TYHNPTGVCL SLERRRALAM
IAAEHGFVLV EDDVYRELSY DAPAPPSVWS IAPPGAVVRI ASFSKSLAPG LRLGYLTADA
SLTRRLIGSG LLDSGGGVNP FTALTVAEVC ATGDFEATVT QLRAMYRERR DALAQSLRMY
LPPGCRFTVP GGGFFQWVEL PEGVDAATLL PRAEQTGVSY LPGSRFYLDA ARSNTLRLSF
SLYPPHQLTE AARRLGEALA AIR