Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TM1040_3041 |
Symbol | |
ID | 4075135 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ruegeria sp. TM1040 |
Kingdom | Bacteria |
Replicon accession | NC_008043 |
Strand | + |
Start bp | 8058 |
End bp | 11189 |
Gene Length | 3132 bp |
Protein Length | 1043 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 638004542 |
Product | hemolysin-type calcium-binding region |
Protein accession | YP_611277 |
Protein GI | 99078019 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG2931] RTX toxins and related Ca2+-binding proteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCCGCA TCAACCGTCC GGCTGCTGAC GCCGAAGAAT TCTTGCTCCA CTCTGAGGTT GAGAATGAGC ACGATTGGCG TGCGCTCCCC GAGACATTCG ATGTTGATGA CTTTGCCCCT TGGGAAGAGG AGTGGGATCT CGAAGAGGAG GAAGAGGAAG AAGAAGACCC AGAGCACATT GACAACTCTC AGTTCTTCCA AACCGATGGG TCCGGCAACG AGGGCCTTAT TGACAACCTG TTGAAACTCA CTGGCAGGGA CAACGCCGCA GCCAATGCCA ATGGCAACGC AAACGGGCTG TTCAAATTGG TGCTGGGCGG CGATGGTGAC GACAGCCTGA CCGGATCCCG GGACGATGAC GTCGTGATGG GCGGGCGTGG TCACGACAGC GTCGAGGGCG GCGAGGGCAA CGATACGCTC ATGGGCGATG AGGACGGCCA GACCGGTTCG GACGCGACGC CGCTCGTGCT CAACATCAAC AATCTCGTCA GCCAGTCCTA TTACGATGGC GCATCTGCCG GGATCGGAGA TACCGCAGTT TACCGTGATG TGGCCTTTCT TGAGGATGGG ACCGCAGTCT GGGGACGCCT TGTTCTGGTG AACGCAAGCG ACCCGAACAT GACCATCGAC CTGTCGGGCG ACCGTGGGGC TGAAATCTCC ATCACCGGCA ACGGCTATGG GGACACCGCC GAGTTCCGGC TCGAGTTTTT TGATCCCACG ACCGGCGAGC CGGTCAACAT CAACTCCACC GCGACTTTCA ACGACCTCGA TCAGAACTAC TACAACTATG ATGTCGAGGC CGTTTCGCTG GATACGGATG AATACACCGC CTTTGGCACC TCGGATGGTA CCTCGCTCAA TGTAACAAAT CAGGATGGCA GCGTGCGCGC GGCCGGGACC GAGGCGAACA ACCCGGATGA CCAGGACGCC TGGTTCTCTG CCGAATTCGA AGGCCGGGAT TTTATTACCT TCACCCTGGA AGCGCGCAGC ACGATCTCCG GGTTCACACT CTCGGGCGAC CTCATCGAAG ATGGAGTCTA CGTCCCGATC GAGCAAGGCA ACGATACGCT GCTGGGCGGG CTTGGGTCGG ATGTCATCAT GGGCCAGGGC GGCAATGACT CGCTTGACGG TGGCGACGGC GACGACAGCA TCGACGGTGG CGAAGGCGAT GACGTCATGT TCGGTGGCAC CGGCTCGGAT GTGCTTATTG GCGGGCTTGG TGGTGACACG ATCTCTGGCG GCGACGGCGA CGACCTCATC GATGGTGGCG AGGGCGACGA TTTCCTTTCG ACCGGCCTTG GCAACGATAC GCTGCTCGGT GGCGAGGGCA ATGATACACT CCACAACTCG GCTGGGGACG ACAGCCTTGT TGGCGGCGTC GGCGACGACA GCATTGTAGC CACCGAAGGC AACGACACCC TTGAGGGGGG CGACGGCAAT GATACGCTCT ACGCAGGGGT CGACAACGAC AGCCTCGATG GTGGCCTCGG AAATGACAGC ATGTTTGGCG AAGCCGGCAG CGACATCATG TTCGGCGGCG GCGGCATGGA CTACATGGAG GGCGGCATCG GAAATGACAC CATCGATGGT GGCGACGATG ACGACACGAT CCTTGGTGGT GATGGCGACG ATCTGATCTA CGGCGGCGCG GGCAACGACA GCCTGACCAC CGGTGCGGGC AATGACACGC TCTTTGGCGG CGAAGGCGAC GATACGCTGC GCAACTCGGT TGGTGATGAC AGCCTGTCCG GTGGCACCGG CAACGACAGC ATTGTTGCGA CCGATGGCAA TGACACGCTT GATGGCGGCA GCGGCAATGA CACCATGTAT GGAGGCAACG ACAACGACCT CCTGATCGGT GGCACCGGTG CGGACCTCAT GTACGGCGAG GCAGATGCCG ATACCTTCCG GATCGAAGAT AATTTTGGCA ACGACACCAT CGTCGGTGGC GAGGCCGGGA ACGATCAGGA CGTTGTCGAT CTGTCTTATA TGACCGGCCC GGTCACGGTG ACCTACACCG GCCCCGAAGC CGGAACGATC ACCGACGGCA CCGACACGAT CCAGTTTTCC GAGATCGAAG AGCTCATCCT CACCGATCAG GACGACGTCG TTGATGCCCG AAACGATACC GCCGGTGTTC AGATTAATGC GGGATCAGGC AACGACAGCC TCACTGGCGG CAGTGGGGAC GACGTGTTTA CCGGCGGCGC CGGGGATGAC CTGATCTCGC TGACTGCCTC GGGCGGGGTC GACCAGATCA CCGACTTTGA CATCAATGAC ACCGATGTTG ATGGCTTTTA CAACGACCAG CTGGATGTCT CTGGCCTGAC GGGTGGGTCC GGCGATGGCG GGGCCGTACG CACCATCGAC GTCACCACAA GCGACGATGG CTTTGGCAAT GCACTTCTGA CGTTCCCGGG TGGCGAACAG CTTGTTTTGC AGGGCGTCAC ACCGGCACAG ATGACCACCC ACGATCAGCT GTTTGCCGCT GGTATCCCCT GCTACACGCC CGGCAGCCGC GTGATCACCG CACGGGGCGA GGTCCCGGTG GAAGAGATCC GCGAAGGCGA TCTGGTCCAG ACAGCTGACA ACGGGCTACA GCCGGTGATC TGGGTCGGCC ACCGCAGGCT CTCGGCTTCG GATCTGGCGC GACATCCGCA TCTGCGACCT GTCCTGCTGC GCCCCGGCGG AATAATCGCA AATGATCGCG CCATGCTGGT CTCGCCGCAG CATCGCTTCT TGCTGCCCTC CGATGGCGCG TCCAGAAGCG AGTCCTTTGT GCGCGCCCGC CTTTTGCTTG AGCTCGACCC TGGTCATGTC CATGCGGAGG AGCGCGCGTC GGGGGTGTGC TACATCCATT TGATGACCGA GGCGCATCAG GTGATCTTTG TAGACGGCTG CGCGACAGAG ACCTTCTGGC CCGGCCCCGA GGCCCTGCGC GGTCTCAGCC CTCGCGACAA ACGCGAGATT TTTGACCTGT TCCCCGAGCT CATTCCTGCG CTTTCGAACC GGGGTGAGAT CGGGCGCAAG CTCGTGTCAT CGCGCTATGG CGCCTTAGCG CGTCAGGATC TGCGGCGCAA AGATATTGCC CAGATGGAAC CTTTCCATCC GCGTCAATCG CGCCGCGCCT GA
|
Protein sequence | MTRINRPAAD AEEFLLHSEV ENEHDWRALP ETFDVDDFAP WEEEWDLEEE EEEEEDPEHI DNSQFFQTDG SGNEGLIDNL LKLTGRDNAA ANANGNANGL FKLVLGGDGD DSLTGSRDDD VVMGGRGHDS VEGGEGNDTL MGDEDGQTGS DATPLVLNIN NLVSQSYYDG ASAGIGDTAV YRDVAFLEDG TAVWGRLVLV NASDPNMTID LSGDRGAEIS ITGNGYGDTA EFRLEFFDPT TGEPVNINST ATFNDLDQNY YNYDVEAVSL DTDEYTAFGT SDGTSLNVTN QDGSVRAAGT EANNPDDQDA WFSAEFEGRD FITFTLEARS TISGFTLSGD LIEDGVYVPI EQGNDTLLGG LGSDVIMGQG GNDSLDGGDG DDSIDGGEGD DVMFGGTGSD VLIGGLGGDT ISGGDGDDLI DGGEGDDFLS TGLGNDTLLG GEGNDTLHNS AGDDSLVGGV GDDSIVATEG NDTLEGGDGN DTLYAGVDND SLDGGLGNDS MFGEAGSDIM FGGGGMDYME GGIGNDTIDG GDDDDTILGG DGDDLIYGGA GNDSLTTGAG NDTLFGGEGD DTLRNSVGDD SLSGGTGNDS IVATDGNDTL DGGSGNDTMY GGNDNDLLIG GTGADLMYGE ADADTFRIED NFGNDTIVGG EAGNDQDVVD LSYMTGPVTV TYTGPEAGTI TDGTDTIQFS EIEELILTDQ DDVVDARNDT AGVQINAGSG NDSLTGGSGD DVFTGGAGDD LISLTASGGV DQITDFDIND TDVDGFYNDQ LDVSGLTGGS GDGGAVRTID VTTSDDGFGN ALLTFPGGEQ LVLQGVTPAQ MTTHDQLFAA GIPCYTPGSR VITARGEVPV EEIREGDLVQ TADNGLQPVI WVGHRRLSAS DLARHPHLRP VLLRPGGIIA NDRAMLVSPQ HRFLLPSDGA SRSESFVRAR LLLELDPGHV HAEERASGVC YIHLMTEAHQ VIFVDGCATE TFWPGPEALR GLSPRDKREI FDLFPELIPA LSNRGEIGRK LVSSRYGALA RQDLRRKDIA QMEPFHPRQS RRA
|
| |