Gene Rru_A1143 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRru_A1143 
Symbol 
ID3834653 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodospirillum rubrum ATCC 11170 
KingdomBacteria 
Replicon accessionNC_007643 
Strand
Start bp1349175 
End bp1350665 
Gene Length1491 bp 
Protein Length496 aa 
Translation table11 
GC content65% 
IMG OID637825232 
ProductGntR family transcriptional regulator 
Protein accessionYP_426231 
Protein GI83592479 
COG category[E] Amino acid transport and metabolism
[K] Transcription 
COG ID[COG1167] Transcriptional regulators containing a DNA-binding HTH domain and an aminotransferase domain (MocR family) and their eukaryotic orthologs 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGTTCAAGC ATTCCCAACT GGATTCCGTA AAGGCATGGA TTGCCCATCC CGCCCATGCG 
GTGATGCCGC TTCACGCGCG GATTCAGCGG GCCATTCGGC AATTGATCGT CGACGGCGCG
CTGGGAGCGG GCAAGCCGCT GCCCGCCTCG CGCGCCCTCG CCACATCGCT TGGGGTATCG
CGCGACACGA TCGAAGCGGC CTATTCCCAA CTTCATGCCG AGGGGTTCAT CGACCGGCGC
GTGGGAAGCG GCAGCTTCGT GGCGGAAATG ACCGAGTTCA TGCCCAGCCG TCCCCTTTCC
CAGCGGGATG CGCTCTTGCG CAACCAAGCT CCCACTCTCA GCACCCGGGG GGCGGCCATG
TTCCGCAGTG GCGGCGTTCG CGAGATGCTC GCCCCACGGC CTTTCGCTCA TGGGGTGCCG
GAAACCCGAA CCTTTCCCCT CCAACTCTGG GAACGTCTGG AACGGCAGGT GCGCAAGGAG
GTCGGCGCGC AAACCCTGTT TCATGGCGAC CCGCAGGGGA CCGAGGCGCT TCGCCGCGCC
ATCGCCGACT ATGTGAACCT GGAACGCGGC GCCCGCGCCA CGGCCGACCG CGTGCTGGTG
CTGACCAGTT CGCAACAGGC GATGTCGCTA TGCGCGACCA TGCTGCTTGA TCCCGGCGAC
CGGATCTTCC TTGAAGACCC CGCCTATTAC GGGGCGCGCA AGGCGTTCGA TGCGGCGGGG
CTGGACTGCG TTCCGATCCC TGTCGACCGG CAAGGTATCG TGGTCGATCA AATCATGGCC
GAACCGCACG GGGCCAAGGC GGTTTTCCTG ACGCCATCCC ACCAGTTTCC GACCGGCGCG
ACGCTGGCGC TGGACCGCCG TCTGGCGCTG ATCGAATGGG CGGCGCGGAC TCAGGCGTGG
ATCATCGAAG ACGATTACGA CAGCGAGTTC CACTACGCGG GCAAGCCGAC GGCCTGCGTG
CAAGGTCTCG ATAAGCATGA CCGCACCCTC TACATCGGCA CCTTCACCAA ATCGCTCTTT
CCGGGCTTGC GGATCGGCTA TGTCGTCTTG CCCCCGCCGC TGGTGAAGCC GATGACCGTC
GCCCGCACCT TGCTTGATGG CCATACGGCC CCCATGGCGC AACTGACGCT GGCCCGCTTT
ATGGAAGGGG GGCATTTCGG GGCGTATGTT CGCACCATGC GCGGCGTCTA TGCCGAGCGG
CTCGCCCTCC TGGCCGGCCT TGTCAGCAAG CATCTGGCGG ATTTCGTCGA GCCGCGCGTT
CCCATTGGCG GGCTGCAACT GCCCTGTCTG TTGACCCGCG ACCTCTCGGA ACGCACCGCC
ATCGACGCGG CGCGACGGGT CGGGATCGAG TTGCTCGGCT TGTCGGCGTT GCACGCCGCC
GGCGATGGCA AAGCCGGCTT CCTGATGGGC TTTGCCGCCT ATACGCCCCT CGAAATCGAG
GAGGCCGTGA GGAAACTGGA AAAGGCGCTG CGGGCGGTGA CAAGGCCATA G
 
Protein sequence
MFKHSQLDSV KAWIAHPAHA VMPLHARIQR AIRQLIVDGA LGAGKPLPAS RALATSLGVS 
RDTIEAAYSQ LHAEGFIDRR VGSGSFVAEM TEFMPSRPLS QRDALLRNQA PTLSTRGAAM
FRSGGVREML APRPFAHGVP ETRTFPLQLW ERLERQVRKE VGAQTLFHGD PQGTEALRRA
IADYVNLERG ARATADRVLV LTSSQQAMSL CATMLLDPGD RIFLEDPAYY GARKAFDAAG
LDCVPIPVDR QGIVVDQIMA EPHGAKAVFL TPSHQFPTGA TLALDRRLAL IEWAARTQAW
IIEDDYDSEF HYAGKPTACV QGLDKHDRTL YIGTFTKSLF PGLRIGYVVL PPPLVKPMTV
ARTLLDGHTA PMAQLTLARF MEGGHFGAYV RTMRGVYAER LALLAGLVSK HLADFVEPRV
PIGGLQLPCL LTRDLSERTA IDAARRVGIE LLGLSALHAA GDGKAGFLMG FAAYTPLEIE
EAVRKLEKAL RAVTRP