Gene TM1040_1226 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_1226 
Symbol 
ID4075934 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008044 
Strand
Start bp1318425 
End bp1320509 
Gene Length2085 bp 
Protein Length694 aa 
Translation table11 
GC content61% 
IMG OID638006534 
Productferredoxin 
Protein accessionYP_613221 
Protein GI99081067 
COG category[R] General function prediction only 
COG ID[COG3894] Uncharacterized metal-binding protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.277788 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTTCGCA TCAACTGCGC AGAATTAAAG GCTAAAGACA TGTCGAACGA TCCCCTCGTC 
GTGTTCACCC CCTCCGGCAA ACGCGGACGT TTTCCAGTTG GCACTCCGGT GCTGACGGCG
GCGCGCCAGC TGGGGGTCGA TCTTGATTCC GTTTGTGGTG GGCGGGGCAT CTGCTCCAAA
TGCCAGATCA CGCCGTCTTA TGGTGAATTC TCGAAACACG GTGTCACCGT GGCCGATGAC
GCCCTGTCGG AATGGAACAA GGTCGAGCAG CGCTATAAGG ACAAGCGCGG TCTGATCGAC
GGGCGCCGTC TGGGCTGTCA GGCGAAGATC GAAAAAGACG TGGTGATCGA CGTCCCGGCC
GAGAGCCAGG TCCACAAGCA AGTGGTTCGC AAGCGGGCCG AGGCGCGCGA CATCGTCATG
AACCCCTCAA TCCGCCTGTA TTATGTCGAG GTAGAAGAAC CGGATATGCA CAAGCCCACC
GGCGACTTGG AGCGTCTGTG TGAGGGGCTT GAGAAGCAGT GGAATATCAA GGGCGTAAAA
GCCGATTTTG AGGTTCTTAG AGAGCTGCAG CCGACCCTGC GCAAAGGCGG CTGGACGGTG
ACGGTCGCGG TGCAACTGGG GACTGAGACC GCGCCTGCGC GCATCATGCA GATCTGGCCG
GGCTACTATG AAGGTACGGT TTACGGGCTG GCGGTGGACC TTGGCTCTAC CACCATTGCG
GCGCATCTGT GCGATCTCAA AACCGGCGAG GTTCTGGCGT CCTCGGGGAT CATGAACCCG
CAGATCCGCT TTGGCGAAGA TCTGATGAGC CGGGTGTCTT ATGCTATGAT GAACAAGGGC
GGTGATCTGG AGATGACCCG CGCCGTGCGT GAAGGGATGA ACGCGCTCTT TTCGCAGATT
TCAGAAGAGG CGCAGATTGA TCAGGGGCTG ATCATGGACG CGGTCTTTGT CTGCAACCCC
GTCATGCATC ACCTGTTTCT CGGAATCGAT CCCTTTGAAC TTGGACAGGC ACCCTTTGCG
CTGGCCACAT CGGACGCCTT GGCGCTGCAA GCGCGCGATC TGGATCTGAA ACTTCACCGT
GCGGCGCGGA TCTATCTGTT GCCCTGTATC GCGGGACACG TCGGCGCGGA TGCAGCGGCC
GTCGCCCTCT CTGAGGCACC GGATAAATCC GATGACCTCG TGCTCGTTGT CGATGTGGGA
ACAAATGCCG AGATCCTATT GGGAAACAAG GATAAAGTGC TGGCGTGCTC TTCGCCAACG
GGCCCTGCCT TTGAGGGGGC GCAGATTTCC TCCGGCCAGC GTGCAGCCCC CGGTGCCATC
GAACGTGTCG AGATCGATCC CGTCACCAAA GAGCCTCGGT TCCGTGTCAT CGGGTCCGAG
ATCTGGTCGG ATCAGGACGG GTTTGAGCAG AGCATCGCAA CCACCGGGAT CACCGGCATC
TGCGGCTCTG GCATCATCGA GGCCATCGCA GAGATGCGGC TGGCTGGTGT ATTGGATGCC
TCGGGTCTGA TCGGATCCGC CGAGCAGACC GGAAGCGCGC GGTGTGTCCC CGAAGGTCGC
ACCAATGCCT ATCTCCTGTG GGACGCAACC GCAGAGGGGG GGCCACGCAT TACAGTCACC
AATCCCGACA TCCGGGCGAT CCAGATGGCC AAGGCGGCGC TCTATTCCGG GGCGCGGCTG
CTGATGGACA AGATCGGCGT GGATCAGGTG GACCGCGTGG TTCTCGCGGG GGCCTTTGGC
GCGCATATCT CGGCCAAACA TGCGATGGTG CTGGGCATGA TCCCCGATTG CCCCCTCGAC
AAGGTGACAA GCGCAGGCAA CGCGGCGGGA ACGGGCGCGC GGATTGCGCT CCTCAACACT
GACGCGCGTC AAGAGATCGA AGATACCGTG CGCAAGATCG AGAAAGTCGA AACCGCTGTC
GAGCCACGTT TCCAGGAGCA TTTTGTCAAT GCTTCTGCAA TTCCGAACTC CGCCGAGCCT
TTCCCGATCC TGCAAAGCGT GGTGAGCCTG CCGGAGGTGA GCTTCAACTC CGCCGGGGCA
GACGCCGCCC GCAGCGGTGG TCGGCGCAGG CGCCGCGGCG GCTGA
 
Protein sequence
MLRINCAELK AKDMSNDPLV VFTPSGKRGR FPVGTPVLTA ARQLGVDLDS VCGGRGICSK 
CQITPSYGEF SKHGVTVADD ALSEWNKVEQ RYKDKRGLID GRRLGCQAKI EKDVVIDVPA
ESQVHKQVVR KRAEARDIVM NPSIRLYYVE VEEPDMHKPT GDLERLCEGL EKQWNIKGVK
ADFEVLRELQ PTLRKGGWTV TVAVQLGTET APARIMQIWP GYYEGTVYGL AVDLGSTTIA
AHLCDLKTGE VLASSGIMNP QIRFGEDLMS RVSYAMMNKG GDLEMTRAVR EGMNALFSQI
SEEAQIDQGL IMDAVFVCNP VMHHLFLGID PFELGQAPFA LATSDALALQ ARDLDLKLHR
AARIYLLPCI AGHVGADAAA VALSEAPDKS DDLVLVVDVG TNAEILLGNK DKVLACSSPT
GPAFEGAQIS SGQRAAPGAI ERVEIDPVTK EPRFRVIGSE IWSDQDGFEQ SIATTGITGI
CGSGIIEAIA EMRLAGVLDA SGLIGSAEQT GSARCVPEGR TNAYLLWDAT AEGGPRITVT
NPDIRAIQMA KAALYSGARL LMDKIGVDQV DRVVLAGAFG AHISAKHAMV LGMIPDCPLD
KVTSAGNAAG TGARIALLNT DARQEIEDTV RKIEKVETAV EPRFQEHFVN ASAIPNSAEP
FPILQSVVSL PEVSFNSAGA DAARSGGRRR RRGG