Gene Rfer_4029 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRfer_4029 
Symbol 
ID3961708 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodoferax ferrireducens T118 
KingdomBacteria 
Replicon accessionNC_007908 
Strand
Start bp4492068 
End bp4493183 
Gene Length1116 bp 
Protein Length371 aa 
Translation table11 
GC content64% 
IMG OID637918853 
Productputative DNA alkylation repair enzyme 
Protein accessionYP_525258 
Protein GI89902787 
COG category[L] Replication, recombination and repair 
COG ID[COG4335] DNA alkylation repair enzyme 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCTGAAG CCCTCAAAAA CCAGTTTGGC GCCGATGTGC CGCGTGCGAT TGCCGCCATG 
ATCTCGGCGG TGCACCCGTC TTTCAACCGC ACGGCTTTTG TGAGCGACGT GCTGGACGGG
TACGACGCGC TGGCGCTGAT GCCGCGGGGA AAAAAAATAG CCCAGGCCTT GCGTCGCCAT
TTGCCAGACG ACTATGCGCA CGCGCTTGCC ATCCTGCTGG ACTCGCTGGA TCAAGCCCAT
GGCCGCGACC CCGGCCAGAG CCTGGCCTCG TTTCTGTACC TGCCGCACAC CCAGTTTGTG
GCTGAGTTCG GCCTGGCCCA CTTCGAGCTG TCGATGCGGG CACAACATGC CTTGACGCAA
CGCTTCACGG CCGAGTTCAG CATCCGCCCG TTCATCGAAC ACCACCCTGA AGCCACCTTG
CGTCAGCTTC AGGCATGGGC ATGCGACCCC AGTGCACACG TTCGCCGGCT GGTGTCCGAA
GGCACGCGCC CCCGGCTGCC CTGGGCACCG CGTCTGCGCC GGTTTCAGGC CGACCCGGCG
CCGGTATTGG CGCTGCTGGA GCTGCTCAAG GATGACCCCG AGTTGTATGT GCGGCGTTCG
GTCGCCAACA ACCTGAACGA CATTGGCAAG GACCACCCGG ACGTTCTGGT CCACACGGCC
CAGGCCTGGC TCCAGGGTGC CAGTGCGCAG CGCGCATGGA TCGTTGGCCA TGCCTTGAGG
TCCGCCGTCA AACGGGGCGA AAGCGGCGCG CTGCAGGTGC TGGGGTTTGG CCAGACGCCC
CGCGTGAGCG TGACCAAGGT CCAGATCAGC CCCCGCCTTG CCGTGACCGG TGGCACCGTG
CAGATTGAGT TTGACGTGAC CAACTGCCAC ACCTCAGCAC AAAGCGTGTT GGTGGACTTT
TGCGTGCATT ACGTCAAGGC CAACGGCCAG ACCCGCGCCA AGGTGTTCAA GCTCAAAACC
CTGCAACGGG CACCCGGCCA GACCGCGCCG CTGGCCAAAA AACTGTCGCT GGCGCAGATG
AGCACCCGCA GACACTACCC GGGGCTCCAC AAGCTGGACG TGATGCTGAA TGGCCAAGCC
CAGCCGCTGG GCGCATTCGA GTTGCTGCAA GCCTGA
 
Protein sequence
MAEALKNQFG ADVPRAIAAM ISAVHPSFNR TAFVSDVLDG YDALALMPRG KKIAQALRRH 
LPDDYAHALA ILLDSLDQAH GRDPGQSLAS FLYLPHTQFV AEFGLAHFEL SMRAQHALTQ
RFTAEFSIRP FIEHHPEATL RQLQAWACDP SAHVRRLVSE GTRPRLPWAP RLRRFQADPA
PVLALLELLK DDPELYVRRS VANNLNDIGK DHPDVLVHTA QAWLQGASAQ RAWIVGHALR
SAVKRGESGA LQVLGFGQTP RVSVTKVQIS PRLAVTGGTV QIEFDVTNCH TSAQSVLVDF
CVHYVKANGQ TRAKVFKLKT LQRAPGQTAP LAKKLSLAQM STRRHYPGLH KLDVMLNGQA
QPLGAFELLQ A