Gene Rfer_1048 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRfer_1048 
Symbol 
ID3963744 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodoferax ferrireducens T118 
KingdomBacteria 
Replicon accessionNC_007908 
Strand
Start bp1110315 
End bp1112378 
Gene Length2064 bp 
Protein Length687 aa 
Translation table11 
GC content61% 
IMG OID637915869 
Productpeptidyl-dipeptidase Dcp 
Protein accessionYP_522320 
Protein GI89899849 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0339] Zn-dependent oligopeptidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACCAGCG TCACCTGCGC AGCACCCGCC ATCAATCCCC TGCTGCAGGA CTGGAGCGGC 
CCTTACGGTC TGCCGCCATT TGCCGCGGTA CAGCCTGATC ACTTTTTGCC TGCATTCGAT
GTAGCCCTGG CGGCTCATTT GGCAGAAATC GACGCCATAG CCGGTAACCC CGAGCCAGCC
AGTTTTACCA ACACCATGCA GGCGTTTGAT GAAAGCGGTC GCCTGCGCAG CCGCACCGAA
TTGCTGTTTC ACAACCTCAC TGCCAGCGAA ACCTCGCCTG CACTGCAGGC GCTCGAGCGT
GAGATGGCGC CGCGCCAGGC GGCGCACAAC AATGCCGTCT ACATGAATGC GACCTTGTTC
CAGCGCATTG ATTCACTTTA TGACCAGCGC AATTCATTGG CGTTGGACGC CGAACAACTG
CGCTTGGTCG AACGCGTCCA TCTGGACTTT GTACGTGCAG GGGCGAAACT CGCGCCCGAG
GCTCGACAGC GCTACGGCGC CGTGATGCAG GAGTTGGCTG GCCTGTGCGC ACAGTTTGGT
CAAAACGTAC TGGCCGATGA GGCATCTTTT GTCTTGGAGC TCAAGGACGA GGCTGATTTG
GTGGGCCTGC CAAAGTTTGT ACTGGCCGCA GCTGAGTCGG CTTCCACGCA GCGTGGCCTG
CCAGTGGGCA GCCATGTGAT CACGCTGTCG CCGTCACTGG CCGAGCCCTT CCTCACCTTC
TCAGCCCGGC GAGACTTGCG CGAACTCATC TGGCGCGCCC GGGTGGCGCG CGGGGCGCAT
ACCGGCAGTC ATGACAACCG CCCGGTCGCG GCCCGCATCA TGGCGCTACG CCAGGAGCAG
GCCGCTCTGC ACGGGTACGC CAGTTATGCC GACTATGAGT TGGTCGACCG CATGGCCGGC
CAGCCCAGCG CGGTGATCGA ACTTTTGATG CAAGCCTGGG AACCCGCCAA GGCGCAGGCG
AACGCCGACC GCGAGGCTCT GGTGGCGATG GCCAAGTCTC TGGGAGAGCC GCAAGAGCTG
ACCGCCTGGG ACTGGCGATA CCTGGCGCAG AAGGTACGCG AGCAGCGCTA CAGCCTCGAT
GATGCCGAGG TCAAGCCCTA TTTCGCCCTG GACAACATGA TCAACGCCAT GTTCGACTGC
GCGCATCGCC TGTTTGGCAT CTCCTTTGTG GAACAGCAGG GCCATGCATT GCATCACCCC
GATGCAAGAC TGTGGGAGGT GCGCGGTGGT GGCGATGAGT TGGTCGGTCT GTTCATTGGC
GACAACTACG CCCGTCCCAG CAAGCGCAGT GGTGCCTGGA TGAGCGTTTT TCGCAGCCAG
TCAGGCCATG CGGGTGGCAC CTTGCCGATC GTCATCAACA ACAATAATTT CGCTAAGGCC
GCGCCCACAC TGTTGAGCTT TGCCGATGTG CGCACGCTCT TTCATGAGTT TGGCCATGGC
CTGCACGGCC TGCTATCAAA GGCGAAGTAT GAGCGGCTGG CCGGCACGCG GGTGCTGCGC
GACTATGTGG AGTTTCCATC GCAGTTGTTT GAAAACTGGG CACTAGAAGA TGAGGTGCTG
ACCAGGCACG CCCGCCACTA CGCCACCGGC GAATCCCTTG GCGCTGCCCT GCTGGCCAAG
CTCAAGGCAG CGCGTCACTT TGACCAAGCC TGGCAAACGG TGCAGTACGT CGGCCCGGCG
CTGATTGACA TGGCCCTGCA CTCGCTGCCC AACGGCTCAC CAGTCGACAT TGCACAGTTT
GAGCTGCAGC AGTGCCAGTT GCTCGGTGTA CCCCCCGACA TTGGCCTGCG CCACCATCTT
TCCCATTTTC AGCACCTGTT TTCTGGGGCG AGCTATGCCG CAGGCTATTA CGTTTACATG
TGGGCTGAGG TGCTGGAGGC AGACGGCTTT GATGCGTTCA CGCAGGCCAG TAATCCGTTT
GACCCCGAGA CCGCTGCTCT CTTGCTGCGC CATGTCTACA GCGCGGGCAA CACGCAGGAG
CCAATGGCAG CCTTTCGGGC GTTTCGGGGG CGCGACCCGC AAGTGGCGCC CATGCTCAAG
AAACGTGGCT TGCTCAGCGC CTGA
 
Protein sequence
MTSVTCAAPA INPLLQDWSG PYGLPPFAAV QPDHFLPAFD VALAAHLAEI DAIAGNPEPA 
SFTNTMQAFD ESGRLRSRTE LLFHNLTASE TSPALQALER EMAPRQAAHN NAVYMNATLF
QRIDSLYDQR NSLALDAEQL RLVERVHLDF VRAGAKLAPE ARQRYGAVMQ ELAGLCAQFG
QNVLADEASF VLELKDEADL VGLPKFVLAA AESASTQRGL PVGSHVITLS PSLAEPFLTF
SARRDLRELI WRARVARGAH TGSHDNRPVA ARIMALRQEQ AALHGYASYA DYELVDRMAG
QPSAVIELLM QAWEPAKAQA NADREALVAM AKSLGEPQEL TAWDWRYLAQ KVREQRYSLD
DAEVKPYFAL DNMINAMFDC AHRLFGISFV EQQGHALHHP DARLWEVRGG GDELVGLFIG
DNYARPSKRS GAWMSVFRSQ SGHAGGTLPI VINNNNFAKA APTLLSFADV RTLFHEFGHG
LHGLLSKAKY ERLAGTRVLR DYVEFPSQLF ENWALEDEVL TRHARHYATG ESLGAALLAK
LKAARHFDQA WQTVQYVGPA LIDMALHSLP NGSPVDIAQF ELQQCQLLGV PPDIGLRHHL
SHFQHLFSGA SYAAGYYVYM WAEVLEADGF DAFTQASNPF DPETAALLLR HVYSAGNTQE
PMAAFRAFRG RDPQVAPMLK KRGLLSA