Gene Rfer_3822 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRfer_3822 
Symbol 
ID3960077 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodoferax ferrireducens T118 
KingdomBacteria 
Replicon accessionNC_007908 
Strand
Start bp4265158 
End bp4266663 
Gene Length1506 bp 
Protein Length501 aa 
Translation table11 
GC content62% 
IMG OID637918647 
Productpeptidase S1C, Do 
Protein accessionYP_525052 
Protein GI89902581 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain 
TIGRFAM ID[TIGR02037] periplasmic serine protease, Do/DeqQ family 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATACCT CTCTCTTTTC CCCGCGGCGC TTGGTGCTGG CACTCGTTTC AGTTGGCGTC 
CTGGGTGCGG CGGGCGTTGG TGCCTTGAGT TTTTCACACG CGCAGGCTGC CGTTTCCACA
GCACCCACTG CAAGCGCAGC AGCGACGGCC CCCGTTGTGG CCCTGCCGGA TTTCTCCCAG
ATTGCGGCAC GTAACGGCGC GGCGGTGGTC AACATCAGCG TCACCGGGAT GATCAATACC
TCGGGTGAAG ACAACGGCGA TGGCGCACCA CACGGCGCCA GGCCGGGTAT GCCCGGCATG
GATCCAAACG ATCCGTTCTT TGAATTCTTC AAGCGCTTTC AAGGTCCCAA CGGTGGCTTT
CCGGGCCAGC CCCGGATGCC GATGCATGGC CAGGGCTCGG GATTCATCAT CAGCCCCGAT
GGCGTGATAC TGACCAACGC CCATGTGGTG CGTGACGCCA AAGATGTCAC CGTCAAACTC
ACCGACCGCC GCGAATTCCG CGCCAAAGTG CTCGGCACTG ACCTGAAGAC CGATGTCGCG
GTACTCAAGA TCGATGCCAA AGACCTGCCA ACCATCACGG TGGGCACTAC GCGCGACCTG
AAAGTGGGCG AGTGGGTACT TGCCATCGGA TCGCCGTTTG GCTTCGAGAA CAGCGTGACC
GCAGGGGTGG TGAGCGCCAA AGGCCGCTCA CTGCCGGACG ACAGTTTCGT GCCCTTCATT
CAGACTGACG TGGCCGTCAA TCCCGGCAAC TCAGGCGGGC CGCTGTTCAA TACCCGCGGC
CAGGTGGTTG GCATCAACTC GCAGATTTAC AGTCAAAGCG GCGGCTATCA GGGCCTGTCG
TTTGCCATTC CGATCGAATT GGCAAGCAAG GTCAAGGATC AGATAGTCGC CACCGGACAT
GCCAGCCACG CGCGCCTGGG TGTCGTGATC CAGGAGGTGA ACCAGACTTT TGCCGATTCC
TTCCATCTCG ACAAACCCGA AGGCGCGCTG GTGTCCAACG TTGACAAGGA CGGCCCGGCG
GACAAGGCGG GGCTCAAGAG CGGTGACGTG ATCCTGAAAG TCAATGGCCA ACCGATCATC
ATGTCCAGCG ATCTGCCGGC CCTGATCGGC ACGGCAGCGC CGGGCGATAA GGTCAGTCTT
GAAATCTGGC GTCAGGGCAA GCGCGAACAA TTCACCGCCA GGCTTGGCGA CGCCAGTGCC
AAGGTGGAGC AGCTGGCCAA GGCAGACGAC GGCGTCGGCC AAGGCAAGCT CGGGCTGGCA
CTGCGTGCCT TGCAGCCGCA GGAGAAACGC GCCGCCGGCG TGAACCAGGG TTTGCTGGTC
GAGGATGCAG CTGGCCCTGC CGCCCTGGCG GGCGTGCAAG CGGGTGACGT ACTGGTGGCG
GTGAACGGCA CGCCGATTGA AAGTGTGACC CAAGTTCGCG CCATAGTGGC CAAGGCCACC
AAATCGGTGG CGCTGCTGAT TCAGCGTGAC GGCAGCAAGA TCTTCGTGCC GGTGCGCCTG
GGTTGA
 
Protein sequence
MNTSLFSPRR LVLALVSVGV LGAAGVGALS FSHAQAAVST APTASAAATA PVVALPDFSQ 
IAARNGAAVV NISVTGMINT SGEDNGDGAP HGARPGMPGM DPNDPFFEFF KRFQGPNGGF
PGQPRMPMHG QGSGFIISPD GVILTNAHVV RDAKDVTVKL TDRREFRAKV LGTDLKTDVA
VLKIDAKDLP TITVGTTRDL KVGEWVLAIG SPFGFENSVT AGVVSAKGRS LPDDSFVPFI
QTDVAVNPGN SGGPLFNTRG QVVGINSQIY SQSGGYQGLS FAIPIELASK VKDQIVATGH
ASHARLGVVI QEVNQTFADS FHLDKPEGAL VSNVDKDGPA DKAGLKSGDV ILKVNGQPII
MSSDLPALIG TAAPGDKVSL EIWRQGKREQ FTARLGDASA KVEQLAKADD GVGQGKLGLA
LRALQPQEKR AAGVNQGLLV EDAAGPAALA GVQAGDVLVA VNGTPIESVT QVRAIVAKAT
KSVALLIQRD GSKIFVPVRL G