Gene Rru_A3041 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRru_A3041 
Symbol 
ID3836487 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodospirillum rubrum ATCC 11170 
KingdomBacteria 
Replicon accessionNC_007643 
Strand
Start bp3504895 
End bp3506298 
Gene Length1404 bp 
Protein Length467 aa 
Translation table11 
GC content74% 
IMG OID637827156 
ProductGntR family transcriptional regulator 
Protein accessionYP_428123 
Protein GI83594371 
COG category[E] Amino acid transport and metabolism
[K] Transcription 
COG ID[COG1167] Transcriptional regulators containing a DNA-binding HTH domain and an aminotransferase domain (MocR family) and their eukaryotic orthologs 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGATCCCT CGCCCTGGAT GCCCGTTTTG CCCGAGGGCC GCCGTCCCTT GCATCGGGCG 
ATCGCCGAAG TTCTGGAAGC CGATATCGCC GCCGGGCGGC TGCCGCCGGG CACCCGCCTG
CCGCCCCAGC GCGATCTGGC TTGGCGCCTG AAGGTGACCC TGGCCACCGT CGGCCGCGCC
TATGCCCTGG CCCGCGAGCG CGGATTGATC GGCGGCGAGG TCGGGCGCGG CACCTTCGTG
CTGGGTGAAA CCACCTCGGC CGGCCTCGCC CCCTGGCCGC CCCAGGCCGG AGCCGGCGAG
AACGGCGCCA TCGATCTGGC CAATACCCAC CCGGCGCCGG TGGCCGGTCC GGGCGAGGTC
GCCGCCTGCG CCCGCGCCCT GGTCGAGGCG GGAACCGGCA TCTTCGCCTA TCAGCCCGAC
ACCGCCGCCC CCGACCACCG CGACGCGGCG GCGCGCTGGC TGACCGCCCA GGGCGTGCCG
GCCGGCGCCG AGTCGGTGCT GCTGTCGACC GGCGCCCTGA ACGGGGTTCT GGCGGCGCTG
ATGGCCCTGG CCCGTCCCGG CGATACGGTT CTGACCGAGG CCCTGACCTC GCCGGCCTTC
AAGGGCATGG CCGCCCTGCT TGGCCTGCGC CTGCGCGCCG TCGCCAGCGA TGGCGAGGGG
CTGCGCCCCG AGGCCCTGGA AGCCGCCCTA GAGCGCGATC CCTCGGCCAA ACTGCTGCTC
TGCGTGCCCG CCCTGCACAA CCCGACCGGG GCGACCCTGT CGGCCGAGCG CCGCCTGTCG
CTAACCGCCC TGGCCGAGCG CTTCGATCTG ACGATCATCG AGGATGCGGT CTATGCGCCG
CTGATCGAAA ACGGCCCCCC CTCGCTCAAG GCGCTGATGC CCGACAGGGT GACCCATGTC
ACCAGCCTAT CGAAGATCGG CCTGCCGGGC CTGCGCTGCG GCATGGTGGT GCCGCCCACC
CGCCGGCGCG AGGCGGTGTT GGCCGCCCTA CGCGTGTCGT GCTGGATGGC GCCGCCGCCG
CTGGCCGCCG TCGCCGCGCG CTGGATGGAC GATGGCACGG TGACCCGCCT GATCGCGGCC
CAGCGCCTTG CCGTCAGCGA ACGCCGCGCC CTCGCCGACC GCCTGCTTGG CGATCGCCTG
ACCTGCCGCC CACCGCCGCC CGCTGCCTCG TTCCTGTGGC TGGCCCTGCC GGCGGGCTGG
CGGGGCGACG ATTTCGCCCA TGCCCTGCAT CGGCGCGGCG TGATGGTGAC CCCGGGCGAG
GCCTTCACCG TCGATACCGC CCATGCCCCC CAGGCCGTCC GCCTGTGCCT GGGGCCGCCG
TCGCTCGAGC GTCTGGAAAC CGCCCTGACC GAGGTTCACG CCCTGCTGGG CGAAAGCCCC
GAACGCCTGG GGGTCTTTCT ATAA
 
Protein sequence
MDPSPWMPVL PEGRRPLHRA IAEVLEADIA AGRLPPGTRL PPQRDLAWRL KVTLATVGRA 
YALARERGLI GGEVGRGTFV LGETTSAGLA PWPPQAGAGE NGAIDLANTH PAPVAGPGEV
AACARALVEA GTGIFAYQPD TAAPDHRDAA ARWLTAQGVP AGAESVLLST GALNGVLAAL
MALARPGDTV LTEALTSPAF KGMAALLGLR LRAVASDGEG LRPEALEAAL ERDPSAKLLL
CVPALHNPTG ATLSAERRLS LTALAERFDL TIIEDAVYAP LIENGPPSLK ALMPDRVTHV
TSLSKIGLPG LRCGMVVPPT RRREAVLAAL RVSCWMAPPP LAAVAARWMD DGTVTRLIAA
QRLAVSERRA LADRLLGDRL TCRPPPPAAS FLWLALPAGW RGDDFAHALH RRGVMVTPGE
AFTVDTAHAP QAVRLCLGPP SLERLETALT EVHALLGESP ERLGVFL