Gene TM1040_3728 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_3728 
Symbol 
ID4075435 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008043 
Strand
Start bp786791 
End bp787861 
Gene Length1071 bp 
Protein Length356 aa 
Translation table11 
GC content61% 
IMG OID638005248 
Productferredoxin 
Protein accessionYP_611957 
Protein GI99078699 
COG category[C] Energy production and conversion 
COG ID[COG0633] Ferredoxin
[COG1018] Flavodoxin reductases (ferredoxin-NADPH reductases) family 1 
TIGRFAM ID[TIGR02008] ferredoxin [2Fe-2S]
[TIGR02160] phenylacetate-CoA oxygenase/reductase, PaaK subunit 


Plasmid Coverage information

Num covering plasmid clones38 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.600358 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCCGCT TTCACCCTCT CGAAGTCACC GACATCAAGA AAACCATCCG CGACGCGGTT 
GTGGTCACGC TGAAGCCCCT CAATGGTGCG GCTGCGGAGT TCGACTTTAC CCAAGGCCAA
TATCTGACGT TCCGGCGTGA TTTTGACGGC ACCGAGCTGC GCCGCAGCTA TTCGATCTGT
GCGGGCAAGG ACGAAGGCAT CCTGCAGGTC GGCATCAAGC GCGTGGATGG CGGGGCCTTT
TCCACCTGGG CCAATACGGT CCTCAAGGTC GGTGACACGG TCGAAGCGAT GCCGCCGCAG
GGGCGGTTTT TCACCGATCT CGATGCCGCC GCCGAGAAAC ACTATCTCGG CTTTGCCGGG
GGCTCTGGCA TCACGCCGGT GCTGTCGATC CTGAAGACCA CCCTGCAGGC AGAGCCTCAG
TCTCGTTTCA CGCTGGTCTA CGCCAACAAG GGCATCAACT CGATCATGTT CCGCGAGGAA
ATCGAGGATC TGAAGAACCG CTACATGGGG CGCCTGTCCG TCATTCATGT GCTGGAAACA
GATGCGCAAG AGGTCGATCT CTTCACCGGT CTGGTGACGC AGGAGAAATG CGCCGAGCTG
TTTGAGCGCT GGATTCCGAT CCAGAGCGTC GACACGGCGT TTATCTGTGG ACCCGAACCG
ATGATGCTCG GCATAGCCGA AGCCCTGCGC ACGGCAGGGC TCTCGGATGA GCAGATCAAG
TTCGAGCTCT TCGCGTCCAA TCAGCCGGGG CGGGCTGCAA AGAAGGCCAC AAGTGGTGAT
GCAGCGGCCA GCGCCCCGGT CACGGCGGCG ATCACGCTGG ATGGGGCCAC GCGCTCTGTC
ACCCTGGATC GCAATACCAG CGTTCTGGAA GCCGCTCTGG AAAATGCGAT GGACGCGCCC
TGGTCCTGTC GGGCGGGCGT GTGTTCCACC TGCCGCTGCC GCGTCATCGA GGGCGAGGTC
GAAATGGCCG CGAACCATGC GCTTGAAGAT GACGAAGTGG CCAAAGGCTT CGTGCTCTCC
TGCCAAGCTT ACCCGCTGAG CGACGCCCTT GTGGTGAGCT ACGACGAGTA G
 
Protein sequence
MARFHPLEVT DIKKTIRDAV VVTLKPLNGA AAEFDFTQGQ YLTFRRDFDG TELRRSYSIC 
AGKDEGILQV GIKRVDGGAF STWANTVLKV GDTVEAMPPQ GRFFTDLDAA AEKHYLGFAG
GSGITPVLSI LKTTLQAEPQ SRFTLVYANK GINSIMFREE IEDLKNRYMG RLSVIHVLET
DAQEVDLFTG LVTQEKCAEL FERWIPIQSV DTAFICGPEP MMLGIAEALR TAGLSDEQIK
FELFASNQPG RAAKKATSGD AAASAPVTAA ITLDGATRSV TLDRNTSVLE AALENAMDAP
WSCRAGVCST CRCRVIEGEV EMAANHALED DEVAKGFVLS CQAYPLSDAL VVSYDE