Gene TM1040_3041 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_3041 
Symbol 
ID4075135 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008043 
Strand
Start bp8058 
End bp11189 
Gene Length3132 bp 
Protein Length1043 aa 
Translation table11 
GC content61% 
IMG OID638004542 
Producthemolysin-type calcium-binding region 
Protein accessionYP_611277 
Protein GI99078019 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG2931] RTX toxins and related Ca2+-binding proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCCGCA TCAACCGTCC GGCTGCTGAC GCCGAAGAAT TCTTGCTCCA CTCTGAGGTT 
GAGAATGAGC ACGATTGGCG TGCGCTCCCC GAGACATTCG ATGTTGATGA CTTTGCCCCT
TGGGAAGAGG AGTGGGATCT CGAAGAGGAG GAAGAGGAAG AAGAAGACCC AGAGCACATT
GACAACTCTC AGTTCTTCCA AACCGATGGG TCCGGCAACG AGGGCCTTAT TGACAACCTG
TTGAAACTCA CTGGCAGGGA CAACGCCGCA GCCAATGCCA ATGGCAACGC AAACGGGCTG
TTCAAATTGG TGCTGGGCGG CGATGGTGAC GACAGCCTGA CCGGATCCCG GGACGATGAC
GTCGTGATGG GCGGGCGTGG TCACGACAGC GTCGAGGGCG GCGAGGGCAA CGATACGCTC
ATGGGCGATG AGGACGGCCA GACCGGTTCG GACGCGACGC CGCTCGTGCT CAACATCAAC
AATCTCGTCA GCCAGTCCTA TTACGATGGC GCATCTGCCG GGATCGGAGA TACCGCAGTT
TACCGTGATG TGGCCTTTCT TGAGGATGGG ACCGCAGTCT GGGGACGCCT TGTTCTGGTG
AACGCAAGCG ACCCGAACAT GACCATCGAC CTGTCGGGCG ACCGTGGGGC TGAAATCTCC
ATCACCGGCA ACGGCTATGG GGACACCGCC GAGTTCCGGC TCGAGTTTTT TGATCCCACG
ACCGGCGAGC CGGTCAACAT CAACTCCACC GCGACTTTCA ACGACCTCGA TCAGAACTAC
TACAACTATG ATGTCGAGGC CGTTTCGCTG GATACGGATG AATACACCGC CTTTGGCACC
TCGGATGGTA CCTCGCTCAA TGTAACAAAT CAGGATGGCA GCGTGCGCGC GGCCGGGACC
GAGGCGAACA ACCCGGATGA CCAGGACGCC TGGTTCTCTG CCGAATTCGA AGGCCGGGAT
TTTATTACCT TCACCCTGGA AGCGCGCAGC ACGATCTCCG GGTTCACACT CTCGGGCGAC
CTCATCGAAG ATGGAGTCTA CGTCCCGATC GAGCAAGGCA ACGATACGCT GCTGGGCGGG
CTTGGGTCGG ATGTCATCAT GGGCCAGGGC GGCAATGACT CGCTTGACGG TGGCGACGGC
GACGACAGCA TCGACGGTGG CGAAGGCGAT GACGTCATGT TCGGTGGCAC CGGCTCGGAT
GTGCTTATTG GCGGGCTTGG TGGTGACACG ATCTCTGGCG GCGACGGCGA CGACCTCATC
GATGGTGGCG AGGGCGACGA TTTCCTTTCG ACCGGCCTTG GCAACGATAC GCTGCTCGGT
GGCGAGGGCA ATGATACACT CCACAACTCG GCTGGGGACG ACAGCCTTGT TGGCGGCGTC
GGCGACGACA GCATTGTAGC CACCGAAGGC AACGACACCC TTGAGGGGGG CGACGGCAAT
GATACGCTCT ACGCAGGGGT CGACAACGAC AGCCTCGATG GTGGCCTCGG AAATGACAGC
ATGTTTGGCG AAGCCGGCAG CGACATCATG TTCGGCGGCG GCGGCATGGA CTACATGGAG
GGCGGCATCG GAAATGACAC CATCGATGGT GGCGACGATG ACGACACGAT CCTTGGTGGT
GATGGCGACG ATCTGATCTA CGGCGGCGCG GGCAACGACA GCCTGACCAC CGGTGCGGGC
AATGACACGC TCTTTGGCGG CGAAGGCGAC GATACGCTGC GCAACTCGGT TGGTGATGAC
AGCCTGTCCG GTGGCACCGG CAACGACAGC ATTGTTGCGA CCGATGGCAA TGACACGCTT
GATGGCGGCA GCGGCAATGA CACCATGTAT GGAGGCAACG ACAACGACCT CCTGATCGGT
GGCACCGGTG CGGACCTCAT GTACGGCGAG GCAGATGCCG ATACCTTCCG GATCGAAGAT
AATTTTGGCA ACGACACCAT CGTCGGTGGC GAGGCCGGGA ACGATCAGGA CGTTGTCGAT
CTGTCTTATA TGACCGGCCC GGTCACGGTG ACCTACACCG GCCCCGAAGC CGGAACGATC
ACCGACGGCA CCGACACGAT CCAGTTTTCC GAGATCGAAG AGCTCATCCT CACCGATCAG
GACGACGTCG TTGATGCCCG AAACGATACC GCCGGTGTTC AGATTAATGC GGGATCAGGC
AACGACAGCC TCACTGGCGG CAGTGGGGAC GACGTGTTTA CCGGCGGCGC CGGGGATGAC
CTGATCTCGC TGACTGCCTC GGGCGGGGTC GACCAGATCA CCGACTTTGA CATCAATGAC
ACCGATGTTG ATGGCTTTTA CAACGACCAG CTGGATGTCT CTGGCCTGAC GGGTGGGTCC
GGCGATGGCG GGGCCGTACG CACCATCGAC GTCACCACAA GCGACGATGG CTTTGGCAAT
GCACTTCTGA CGTTCCCGGG TGGCGAACAG CTTGTTTTGC AGGGCGTCAC ACCGGCACAG
ATGACCACCC ACGATCAGCT GTTTGCCGCT GGTATCCCCT GCTACACGCC CGGCAGCCGC
GTGATCACCG CACGGGGCGA GGTCCCGGTG GAAGAGATCC GCGAAGGCGA TCTGGTCCAG
ACAGCTGACA ACGGGCTACA GCCGGTGATC TGGGTCGGCC ACCGCAGGCT CTCGGCTTCG
GATCTGGCGC GACATCCGCA TCTGCGACCT GTCCTGCTGC GCCCCGGCGG AATAATCGCA
AATGATCGCG CCATGCTGGT CTCGCCGCAG CATCGCTTCT TGCTGCCCTC CGATGGCGCG
TCCAGAAGCG AGTCCTTTGT GCGCGCCCGC CTTTTGCTTG AGCTCGACCC TGGTCATGTC
CATGCGGAGG AGCGCGCGTC GGGGGTGTGC TACATCCATT TGATGACCGA GGCGCATCAG
GTGATCTTTG TAGACGGCTG CGCGACAGAG ACCTTCTGGC CCGGCCCCGA GGCCCTGCGC
GGTCTCAGCC CTCGCGACAA ACGCGAGATT TTTGACCTGT TCCCCGAGCT CATTCCTGCG
CTTTCGAACC GGGGTGAGAT CGGGCGCAAG CTCGTGTCAT CGCGCTATGG CGCCTTAGCG
CGTCAGGATC TGCGGCGCAA AGATATTGCC CAGATGGAAC CTTTCCATCC GCGTCAATCG
CGCCGCGCCT GA
 
Protein sequence
MTRINRPAAD AEEFLLHSEV ENEHDWRALP ETFDVDDFAP WEEEWDLEEE EEEEEDPEHI 
DNSQFFQTDG SGNEGLIDNL LKLTGRDNAA ANANGNANGL FKLVLGGDGD DSLTGSRDDD
VVMGGRGHDS VEGGEGNDTL MGDEDGQTGS DATPLVLNIN NLVSQSYYDG ASAGIGDTAV
YRDVAFLEDG TAVWGRLVLV NASDPNMTID LSGDRGAEIS ITGNGYGDTA EFRLEFFDPT
TGEPVNINST ATFNDLDQNY YNYDVEAVSL DTDEYTAFGT SDGTSLNVTN QDGSVRAAGT
EANNPDDQDA WFSAEFEGRD FITFTLEARS TISGFTLSGD LIEDGVYVPI EQGNDTLLGG
LGSDVIMGQG GNDSLDGGDG DDSIDGGEGD DVMFGGTGSD VLIGGLGGDT ISGGDGDDLI
DGGEGDDFLS TGLGNDTLLG GEGNDTLHNS AGDDSLVGGV GDDSIVATEG NDTLEGGDGN
DTLYAGVDND SLDGGLGNDS MFGEAGSDIM FGGGGMDYME GGIGNDTIDG GDDDDTILGG
DGDDLIYGGA GNDSLTTGAG NDTLFGGEGD DTLRNSVGDD SLSGGTGNDS IVATDGNDTL
DGGSGNDTMY GGNDNDLLIG GTGADLMYGE ADADTFRIED NFGNDTIVGG EAGNDQDVVD
LSYMTGPVTV TYTGPEAGTI TDGTDTIQFS EIEELILTDQ DDVVDARNDT AGVQINAGSG
NDSLTGGSGD DVFTGGAGDD LISLTASGGV DQITDFDIND TDVDGFYNDQ LDVSGLTGGS
GDGGAVRTID VTTSDDGFGN ALLTFPGGEQ LVLQGVTPAQ MTTHDQLFAA GIPCYTPGSR
VITARGEVPV EEIREGDLVQ TADNGLQPVI WVGHRRLSAS DLARHPHLRP VLLRPGGIIA
NDRAMLVSPQ HRFLLPSDGA SRSESFVRAR LLLELDPGHV HAEERASGVC YIHLMTEAHQ
VIFVDGCATE TFWPGPEALR GLSPRDKREI FDLFPELIPA LSNRGEIGRK LVSSRYGALA
RQDLRRKDIA QMEPFHPRQS RRA