Gene Rfer_1114 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRfer_1114 
Symbol 
ID3964352 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodoferax ferrireducens T118 
KingdomBacteria 
Replicon accessionNC_007908 
Strand
Start bp1196401 
End bp1197909 
Gene Length1509 bp 
Protein Length502 aa 
Translation table11 
GC content56% 
IMG OID637915935 
Producttryptophan halogenase 
Protein accessionYP_522386 
Protein GI89899915 
COG category[R] General function prediction only 
COG ID[COG0579] Predicted dehydrogenase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.650107 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCCGCGA ATATGCTCAG GAGTATCGTC ATCGTCGGTG GTGGCACCGC AGGTTGGATG 
ACGGCCGCTG CGCTGTCCAA TGTATTGGGT GATCACTATC ACATTCGCCT GATCGAGTCC
GACGAGATCG CCACCATCGG CGTGGGTGAG GCGACTATTC CGCTGATCAA AGACTTCAAT
CTGGCGCTGG GCATTGACGA AAACGAATTC ATGCGCCAGA CGCAGGGGAC TTACAAACTC
GGCATCGAGT TTGTCAACTG GGGCAAGATC GGCGACTCCT ATATTCACGG TTTCGGCAAG
ATCGGTCAGG ACCTCGGCCC CATTGCGTGT TACCAATATT GGCTCAAGAT GCATCAGGCC
GGTGAAGCAT CGGATCTGGG AAACTATTCA ATCAATACCC TGGCACCCAG AAAGTCGAAA
TACCTTCGAA GCGAGCCCGA AATGGCCGGT TCACCCTTGG GCGACATTAA CAACGCCTTT
CACTTCGACG CAGGTTTATA TGCCAAATTC TTGCGCGGCT ACTCGCAGGC GCGAGGCGTG
GTGCGGACCG AAGGCCGGAT CGTGCAAACC ATGCTGCGGG AATCGGATGG CTTTATCGAA
TCTGTCGTTC TGGCCAGCGG CGAAAAAATA TCCGGGGACT TTTTCATCGA TTGTTCCGGC
ACGCGTGCAC TTCTGATCGG AGATGCACTT AAGTGCGAAT ACGAGGACTG GTCGCATTGG
CTGCCCTGCG ATCGGGCGAT TGCCGTGCCC TGTGAATCCG TGCAGCCACT GGTTCCCTAC
ACGCGCTCCA CGGCCCACTC TGCGGGTTGG CAATGGCGCA TTCCGCTGCA GCACCGCATC
GGCAACGGCC ACGTCTATTC CAGCCGCTTC ATGAGCCAGG ACGAGGCCAC GTCGATTCTG
CTGAACAAGC TGGACGGCAA GCAACTGGCA GAGCCGCGTT ATATCCCCTT CGTTCCGGGG
CGTCGCAAGC AGACCTGGCG CAATAACTGT GTTGCGGTGG GCCTGTCCAG CGGCTTTTTC
GAACCCATCG AGTCCACCAA TATTCATTTG ATCCAGTCCG CTATCGCACG GGTGATCAGG
TTGTTTCCCA ATATGGGATT TCAACAAGCC GATATCGACG AATACAACGC GCAAACACAG
TTCGAGTACG AGCGTATACG CGACTTCATC ATCTTGCACT ACAAGGCGAC GCAACGCGAC
GATTCACCGT TCTGGAATCA CTGCCGGAAC ATGGAAATAC CGGCTACGCT GCAGCACAGG
ATTAGCCTGT TCAGCAGCAA TGGCCGGGTC TACCGCGAGG GGCAGGAGCT GTTTGGCGAC
GTGAGTTGGG TGCAGGTGAT GCACGGTCAA GGTATCCGGC CACAGGGCTA TAACCCTTTG
GTGGATTTGC GCCCCAAGGA CGAAATCAGA GCTTACCTTG GCAATATCGA GGCCGTCATC
AAGAAGTGTG TCGACGTCAT GCCAACGCAC GCAGAATTCA TCGCGAAAAA CTGTGCGGCC
GCCGGGTAA
 
Protein sequence
MSANMLRSIV IVGGGTAGWM TAAALSNVLG DHYHIRLIES DEIATIGVGE ATIPLIKDFN 
LALGIDENEF MRQTQGTYKL GIEFVNWGKI GDSYIHGFGK IGQDLGPIAC YQYWLKMHQA
GEASDLGNYS INTLAPRKSK YLRSEPEMAG SPLGDINNAF HFDAGLYAKF LRGYSQARGV
VRTEGRIVQT MLRESDGFIE SVVLASGEKI SGDFFIDCSG TRALLIGDAL KCEYEDWSHW
LPCDRAIAVP CESVQPLVPY TRSTAHSAGW QWRIPLQHRI GNGHVYSSRF MSQDEATSIL
LNKLDGKQLA EPRYIPFVPG RRKQTWRNNC VAVGLSSGFF EPIESTNIHL IQSAIARVIR
LFPNMGFQQA DIDEYNAQTQ FEYERIRDFI ILHYKATQRD DSPFWNHCRN MEIPATLQHR
ISLFSSNGRV YREGQELFGD VSWVQVMHGQ GIRPQGYNPL VDLRPKDEIR AYLGNIEAVI
KKCVDVMPTH AEFIAKNCAA AG