Gene Rfer_3000 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRfer_3000 
Symbol 
ID3960345 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodoferax ferrireducens T118 
KingdomBacteria 
Replicon accessionNC_007908 
Strand
Start bp3307705 
End bp3309252 
Gene Length1548 bp 
Protein Length515 aa 
Translation table11 
GC content67% 
IMG OID637917820 
ProductAraC family transcriptional regulator 
Protein accessionYP_524242 
Protein GI89901771 
COG category[F] Nucleotide transport and metabolism
[L] Replication, recombination and repair 
COG ID[COG0122] 3-methyladenine DNA glycosylase/8-oxoguanine DNA glycosylase
[COG2169] Adenosine deaminase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACACCCC ATCACGCACA GCCCGACGAC GACGCCTGCT ACCTCGCGCT CAGGGCGAAG 
GACGCACGTT TTGACGGCTG CTTTTACACC GGCGTTACAT CTACCGGCAT TTATTGCCGT
CCGGTGTGCC GGGTACGCAC CCCCAAGCGT GAAAACTGCC GGTTTTTTGT TCACGCGGCG
CAGGCGGAAC AGGCCGGTTT TCGGCCCTGC CTGCGCTGCC GCCCGGAACT CGCACCGCGC
ACCTCTGCCC TCGGCCACGT GTTTGAAACC AAGCCGTGGT CCATCCAGGA CGCCTCCAGC
ATCTTGGCCA CTCAGGCGGC CCGCCTGCTG GACACACCGG AAGCCTGGAC CGAGAGCACA
CCGAGCGTGC AGCGGCTGGC CCAGCGCCTG GGCGTGAGCG ACCGCCACCT ACGCCGCATT
TTTGAAGCGC AGTTCGGCGT GTCGCCCCTG CAGTACCTGC AAACACAGCG CTTGCTGAGC
GCCAAACAGT TGCTGACCGA CACCGCACTG GCCGTCACGC AGGTCGCGCA CTTGAGTGGC
TTCACCAGTG TGCGGCGCTT CAATGCCGTG TTTGCCGCGC ACTACGCCCT CAGCCCGACG
CAGTTGCGCC GCAACGGCGC AAAACACGGG CACGATGCAC CCGGCCAGAG CCTTCAGGTC
CGGCTGGCCT ACCGCCCGCC CTATGACGTG GACGCCATGC TGCAGTTCTT TGGCAAACGC
CAGGTCACGG GGATTGAATG CGTCACGCTG GAGCCCGGCA ACCAGAGCAT CGGCAAAACC
GTGGCCGTGC AATGCGGCCA GACGCTCTAC ACCGGCTGGC TGGTGGCCCG CTTTGAGGCT
GCTGGCAACC GTGTGGATCT GCGCTTGAGT GATTCGCTGC GCGAGGTGCT GCCGTGGGTA
ATGGCGCGCG TGCGGGCCAT GCTCGATCTG GATGCGGATC CCCGCGCCAT CAGCAGCCTG
TTGCGCACCA GCTTCCCACG GGGCGACGGT TTGCGCGTGC CCGGCACCCT GGACGGCTTT
GAGCTGGCCG TGCGCGCCGT CCTGGGCCAA CAGATCACCG TGGCTGCGGC GCGCACCCTG
GCGGCAAGAC TGGTACAACG GTTTGGTGAA CCGATCGAAA CCCCGATCGC CGGGCTGAAC
CGGCTTTTTC CAAGCGCCGC GGCACTGGCC GCCGCCAGCG GCGACGCACT GGGCCAACTG
GGCATCGTCC GGCAGCGGCA AGCGGCCATT CAGGCCATCG CCAGCGCCGT GGCGGCGCAG
CGCCTGCCAC TGCACGCCGG GGCGGACGTG CCCTCCACCC TGGCCGCCCT GAAGGCTTTG
CCCGGCATTG GCGACTGGAC CGCCCAGTAC ATCGCCATGC GCGCACTGCG CTGGCCGGAC
GCCTTTCCGG CCGGCGACGT CGCCCTGCAC AAGGCACTGG GCGTGCAAGA CGCCAAAAAC
CCGGCCAAAG AAGCGCAGGC CGCGTCACTG GCCTGGCAAC CGTGGCGCAG CTACGCCGTG
ATCCGAGCCT GGGCCGCCCC ACCGGTTGCC GCAGATGCTA CAAAATAG
 
Protein sequence
MTPHHAQPDD DACYLALRAK DARFDGCFYT GVTSTGIYCR PVCRVRTPKR ENCRFFVHAA 
QAEQAGFRPC LRCRPELAPR TSALGHVFET KPWSIQDASS ILATQAARLL DTPEAWTEST
PSVQRLAQRL GVSDRHLRRI FEAQFGVSPL QYLQTQRLLS AKQLLTDTAL AVTQVAHLSG
FTSVRRFNAV FAAHYALSPT QLRRNGAKHG HDAPGQSLQV RLAYRPPYDV DAMLQFFGKR
QVTGIECVTL EPGNQSIGKT VAVQCGQTLY TGWLVARFEA AGNRVDLRLS DSLREVLPWV
MARVRAMLDL DADPRAISSL LRTSFPRGDG LRVPGTLDGF ELAVRAVLGQ QITVAAARTL
AARLVQRFGE PIETPIAGLN RLFPSAAALA AASGDALGQL GIVRQRQAAI QAIASAVAAQ
RLPLHAGADV PSTLAALKAL PGIGDWTAQY IAMRALRWPD AFPAGDVALH KALGVQDAKN
PAKEAQAASL AWQPWRSYAV IRAWAAPPVA ADATK