Gene TM1040_3767 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_3767 
Symbol 
ID4074939 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008042 
Strand
Start bp27 
End bp5300 
Gene Length5274 bp 
Protein Length1757 aa 
Translation table11 
GC content61% 
IMG OID638004420 
Producthemolysin-type calcium-binding region 
Protein accessionYP_611162 
Protein GI99077903 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.902425 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCAACAA TTCTGACTTT TTCCAGCAAC GATGGCAACG GCATCGGTGC CGATAATTTG 
GCGACCGACG GGGAGGGAGG GAGCAGCGAT ATCGGCGGGA TCACGATCCA GATCGCCAAT
ATCAGTGACG TGCTCGGCAC ACAGATCTCC TCACTCAACT GGTACAATGA CGGCAACTTG
TCATCCGGCG ATGGGTTCTC CGGAATCACG ACAGACTTCG CCTTTGGCGG GGAATGGAAG
GGGATGTCCA TCCGCAGTGC TGGAGGCGAG GAGTTTCAAC TCGAGTCGTT TGACTTCTTT
GACTGGGGCA CCTTCCCGTA CGCGGCATCG ACACTCAATA TCACTGGCTT TCGGGACAAC
GTTCAGGTCG CAACCGATAC CTTCTCGTCG AACAACAATA ACAATCGGGT CAACGTCGAC
ATCTCTGGGA ATACGACCTT CGACGCTGTC GATGAAGTAC GGATCACCTT CGATATCGGC
ACTGGGTATC TATCTCTGAA TAACATCACG ATTGCCAACG CGGTGCCGCC CAACACGGCG
CCATCGCTCG GCGGCACGCC GGCCGACGAC ACGGCAGTTG AAGACGCCGC CGCAGCTATC
GATCTGTCCG CCTACAATAT CTCGGATGCC GAGGGGGATG ACCCGCTCAC GCTGACCCTG
GCCGTTGATC GCGGCACGAT TGCGTCTGTG GATGGCAACG GAACAGTCGA CGGCGTGACC
ATCGCGAGTT CCAGCACCAG CAACATGACT TTGCAGGGCT CGGCCGCTGC GCTCAACACC
TATCTCGACG ACACGAGCCA CATTACCTTT ACGACCGACA GCAACGATAC GAGCGCGGCC
ACACTGACGG TTACCCCCAA CGACGGCACG GCCAATGGCA CCGCGGATAC CGTGACCATT
ACGATCACGC CGGTGAACGA CGACCCTTCC GTCTCTGGTC TGCCTGCTTC GGTGACGGTG
GCCGAAGACA CACTCAGCAA TATTGATCTC TCGGCAGCAA GTATTGGCGA CATCGACAGT
AGCACCATCA CGGTGACACT CTCTGCCAGC GGAGGCAGCT TTGGCACGCC GAGCGACGGT
GCGGGCACCG GCGGTGGCGT GACCGAGACG CTGGTGAATG CGACCACCAT CTCGCTGGTG
GGATCGCCAG CCGATATCAG CACCTATCTC GATACCGCAT CGAACATCCG CTATACACCG
CCTTCGAACG CTGTGGGGGC GGGACAAGCG ACGATCAGCG TCACCGCGAA TGACGGCGAC
GGCTCGGGTG ACGTGGCGCT TGGCACAGTG CAGGTGAATG TGACTGAGGT GAACGATCCG
CCTGTGCTGA CGGGTCTCGA CGCCACGCCG AGCGTGACCG AAGATGGCAT CCCCGTGGTG
CTCGATGCCG ATGTGACCAT CTCCGACGAA GAGCTGGACG CGCTCAATGG CGGTAACGGC
GACTATGCCG GGGCGACACT TGCGATTGCG CGCAATGGTG GCGCAGATAC GTCCGACATC
CTGTCGGTGA TCACCGGCGG CAACCTCACG GTTGCGGGCG GGCCCAACGG GGGCGGTACG
ATCAGCGCGA GCGGCAACGT CATCGCAACC ATCAGCAACA CTGGAAACGG CCAACTTCAG
ATCACCTTCG CCGACAACGG CACCACGCCG ACGACCGCTC TCGTCAACGA ATCCTTGCAG
GCGATCCGCT ACACCAACGG CGCAGACGAT CCCGCCACCT CGGTCCAACT AGACTGGGAT
TTCTCGGACG GGAACGGACA TGCTGCGGGC TCGGTCACCG TGTCGCTGAC CAACGTCAAC
GACGCACCGA CCCTGACGGC GACCGGCGGC AACCCGACCT TTGTCGAGGG CGGCGCGGCC
CAGGATCTCT ACAACACGGT CTCTGCCGAT GCCGTGGAGA TCGGAGATCG CATCACGGGC
CTGACCCTGA CGGTCACCAA TCTCGCCGAT AGTGCATCCG AGATCTTGTC GATCGATGGA
TCCGATGTTG CCCTGACGGA TGGCAATTCC GTGACCACGG CGACCAACAG CCTGTCGGTG
TCCGTGAACG TAAGCGGCAC CACCGCGACG GTGTCGTTCA CCGGGGCGAC GCTGAGCGCG
GCGCAAGCGC AGACCCTCGT CGACGGCCTC ACCTATCGCA ACCTGTCCGA CAATCCCACG
ACCGCCGGCA ACCGGGTGGT CACGATCACC GGCATCACAG ATGATGGTGG AACCGCCAAT
GGCGGTGCGG CAAGCAATGC ACCAACCCTG TCCTCGACGG TCAGCCTCAC GGCGGTCAAT
GATGCGCCGT CGGTGGCCAG CGTCTTTGGC GAGACCAGCC AGATTATTAC GGGCGGCGGC
GCACAGGCGA TCACTGGCCT TGCCAATGCG ACCGTGTCGA ACGCAGATTC GATCGATTAC
AACGGCGGCT TTCTAACCAT CGCCCAGATA AGCGGCGCGG GTAATGGCAG TTGGGGCGTC
GATGGTACGA CGGTCACCGC AGGCGGAGAT GGCACAATTT TAGTAGGCGA GACACTGCAA
GTCAGCGGTG TAACGATCGG GACAATCGAT GCGACAGCAG ATGGACAGAG TGGCCACGAC
CTGACGATCA ACTTCAACGC CGACTCAAGC TCGGCCCGCA TCGAGGCGTT CTTGCAGAAC
CTGCGCTTCG AGGCACCATC CTCAATCGGC GATCGAGGCT ATACGCTCAC ACTCAACGAC
GGGGATGGCA TTGCCAATGG TGGCGATGCG GATGCCAGCG GCAGTATCAC GCTCTCGATC
ACACCCAATC CGCCTGTCCT GGGCAACCTC GATGGCGACA GCGTCAGCGC CACCGAGAAC
GCCGGTGCGG TCTCGCTGGA TATCGGCGCA AACGCAACGG TCACCGACGC CGACTCGCCC
AATTTCAACG GAGGCACCCT GAGGGCCTCG GTGACAAACA ACGCCGATGC AGCCAGCGAC
GTTCTAAGCG TCGCCACCAG CGGTGTCGTG GCCCTCGCGG CCATGACGGC AGGCTCGAAC
GTTTCGGTGT CTGGCACCGT GATCGGCACG CTTGCCAACA ACATCGTGGC TGGCAACGAC
TTCGTCGTGA CCCTGAACGC GGATGCCACG CCGGCGCGGG TGCAGAGCAT CGTTCAGGCA
CTGAGCTTTG AAGCGACAGG CGAGGCGCCG ACAGCTGGCA CACGGACCGT AAGCGTCACC
CTCTCCGACG GTTCCGGATC TGGAAACGCC CATGTCAATG TGGCGGTCAC AGCGCTCAAC
GATGCGCCGC AAATCACGGG CCTGGTGAGC GATGTCTCGT TTACCGAGGA CACCCCGGCC
AATCTCGATC TCTCCGCGCT CACGCTCTCT GATGTCGACA CAACGAGTAG CCTGACCCTG
ACACTAACCG CCAGTGCCGG CACGCTGGCG GCGACAAGTG GCGGCGGTGT GACGGTTGGC
GGGAGCGCAA CAGGCACATT GATCCTCGCG GGCACCATCG TGGATATAGA CACGTTCCTA
AACAGCGCCA CCCATGTTCA TTACAGCCCG GCCGCGAATG CCGCCGGCAA TGATGCAGCC
ACGATCTCAC TGACCATCAA CGATACCGGC ACCAGCACGG ATCTCGGGAC GGTGAATGTG
GACATCACGC CGGTGAATGA CGCGCCGACC AACCCGGCAA CCGCCGATAT TCAGCAAGCC
TTCGACACGC CGCATACCTT CACTGTGGGG GATTTCGGCT TTGCCGACGT GGACGGCGAT
ACGCTTCAAA GCGTGCGGAT CGACACGCTG CCGGGTGCTG GCGTTCTGAG CCTGAACGCC
AGCCCGGTCA GTGCAACAGA CGTCATTACG ATCGGTGACA TCATCGGCGG CAATATCGTG
TTCACGGCCG CCTCCGGCGC ATCGGGCGAT GATTACGCAA TCTTCACCTA CAGCGTGAAC
GATGGCACCA CGTTCTCTGC TGCCCCGGGC ACCATCACGA TCGACGTCGC CGCAGCGCCG
CCTGCAGGCG GCGGTGGCCA GCCGGCCGAG CCGGAACCAG ACTTGGTTGA CGGGGTCCCT
GTTGAGCGGA CCACGACAAC CGAGAACAGC CTCAGTGTCG AGAAGATCGT CATCACACCC
GTCTCGAACA CACGCGAGGA CACGGATGCG GCCACATCGT TGGCCGATAT TCCGCTGCAT
TTCGACAGCG ACGGACAAGC CGTGACCACT GTCAGCGTGC CAACGGGCGT TGGATTCACG
GCACGGGCAA ATGAGACCGC CACGATCCAG ACCAGCTTCC GGGATGGGTT GTACTTGCTG
CGCGATTCCG CGCCAGAATC CGACTGGTTC TCGATGATCA ACGGGCTCGA TCAATGGCTG
AACGGAGTAA GCGCGGGTTG GCTCAACCAG GTCACGCTCA CCACGAACAC AACCTCGGCA
CCCTCTGAGC CGATCCGGAT TACCGGACGG GAAGGCGACA GCACTGAGGT TATGGTGATT
GACGGCTCGC AATTGCCTGT AGGCGCTGTG CTCGATCTCG ACAATATCGA GTTTGCGATC
ATCGTCGGCG ATGTCACCGT GCGCAGCGGT GCTGGCACAA ACGTCGTCTA TGCTGGGAGC
GGCGCTCAGA ACATCGTGCT TGGCGCAGAT GATGACCACC TGAACGGCGG CGATGGAGAT
GACACGATCG GATCGGAAGG CGGTGATGAT CGCCTGTTTG GCGATGCGGG CAACGACACG
CTCTTCGGCG GCCCAGGTGC AAACGTTATG CACGGCGGCT TGGGTATGGA TACCATCATC
TACGCGGCAA ACCGAGATCG CTTCGAGATC AGCATCGAGA AGGGCCAAGT GATCGTGACG
TCGCTCGATG GCCCGAGCCT GCAGGATACG ATCATTAACG CGGAGCTTAT CCGGTTCACG
GATACGGATC TTTCAATCGA TGCGTCCTCG AGCGCCTTCT CAGAAAGCGA TCTGACGATT
GCCACGCTCT TCACGACGGT TCTCGGCCGC CAGGCCGATC TGGCCGGCTA CCAGTTCTGG
ACTAACGTGG CCGACACGAA GCTCGATCTC GCCTCCATAG CGATGTTCAT GCTGCGCTCG
GAGGAAAACA CGCAAACCGA AGGTCAGGCC TTCGATACGC TCAGCCTTGA GGCGCAGATC
GACCGCCTCT ATCAGGAGAT CCTTGGCCGC GCGCCGGATG ATGCGGGCGC CGCGTTCTGG
AACGCGGCTG CCGAGGCCGG CTTTGCCATC GACGACATCG CTGGCGCCTT TGTCGACGCG
CCCGAGTTCG TGGGTCTGGC GCTGCAGCCG ACCGGGCTCG ATTTCTTGAT CTGA
 
Protein sequence
MSTILTFSSN DGNGIGADNL ATDGEGGSSD IGGITIQIAN ISDVLGTQIS SLNWYNDGNL 
SSGDGFSGIT TDFAFGGEWK GMSIRSAGGE EFQLESFDFF DWGTFPYAAS TLNITGFRDN
VQVATDTFSS NNNNNRVNVD ISGNTTFDAV DEVRITFDIG TGYLSLNNIT IANAVPPNTA
PSLGGTPADD TAVEDAAAAI DLSAYNISDA EGDDPLTLTL AVDRGTIASV DGNGTVDGVT
IASSSTSNMT LQGSAAALNT YLDDTSHITF TTDSNDTSAA TLTVTPNDGT ANGTADTVTI
TITPVNDDPS VSGLPASVTV AEDTLSNIDL SAASIGDIDS STITVTLSAS GGSFGTPSDG
AGTGGGVTET LVNATTISLV GSPADISTYL DTASNIRYTP PSNAVGAGQA TISVTANDGD
GSGDVALGTV QVNVTEVNDP PVLTGLDATP SVTEDGIPVV LDADVTISDE ELDALNGGNG
DYAGATLAIA RNGGADTSDI LSVITGGNLT VAGGPNGGGT ISASGNVIAT ISNTGNGQLQ
ITFADNGTTP TTALVNESLQ AIRYTNGADD PATSVQLDWD FSDGNGHAAG SVTVSLTNVN
DAPTLTATGG NPTFVEGGAA QDLYNTVSAD AVEIGDRITG LTLTVTNLAD SASEILSIDG
SDVALTDGNS VTTATNSLSV SVNVSGTTAT VSFTGATLSA AQAQTLVDGL TYRNLSDNPT
TAGNRVVTIT GITDDGGTAN GGAASNAPTL SSTVSLTAVN DAPSVASVFG ETSQIITGGG
AQAITGLANA TVSNADSIDY NGGFLTIAQI SGAGNGSWGV DGTTVTAGGD GTILVGETLQ
VSGVTIGTID ATADGQSGHD LTINFNADSS SARIEAFLQN LRFEAPSSIG DRGYTLTLND
GDGIANGGDA DASGSITLSI TPNPPVLGNL DGDSVSATEN AGAVSLDIGA NATVTDADSP
NFNGGTLRAS VTNNADAASD VLSVATSGVV ALAAMTAGSN VSVSGTVIGT LANNIVAGND
FVVTLNADAT PARVQSIVQA LSFEATGEAP TAGTRTVSVT LSDGSGSGNA HVNVAVTALN
DAPQITGLVS DVSFTEDTPA NLDLSALTLS DVDTTSSLTL TLTASAGTLA ATSGGGVTVG
GSATGTLILA GTIVDIDTFL NSATHVHYSP AANAAGNDAA TISLTINDTG TSTDLGTVNV
DITPVNDAPT NPATADIQQA FDTPHTFTVG DFGFADVDGD TLQSVRIDTL PGAGVLSLNA
SPVSATDVIT IGDIIGGNIV FTAASGASGD DYAIFTYSVN DGTTFSAAPG TITIDVAAAP
PAGGGGQPAE PEPDLVDGVP VERTTTTENS LSVEKIVITP VSNTREDTDA ATSLADIPLH
FDSDGQAVTT VSVPTGVGFT ARANETATIQ TSFRDGLYLL RDSAPESDWF SMINGLDQWL
NGVSAGWLNQ VTLTTNTTSA PSEPIRITGR EGDSTEVMVI DGSQLPVGAV LDLDNIEFAI
IVGDVTVRSG AGTNVVYAGS GAQNIVLGAD DDHLNGGDGD DTIGSEGGDD RLFGDAGNDT
LFGGPGANVM HGGLGMDTII YAANRDRFEI SIEKGQVIVT SLDGPSLQDT IINAELIRFT
DTDLSIDASS SAFSESDLTI ATLFTTVLGR QADLAGYQFW TNVADTKLDL ASIAMFMLRS
EENTQTEGQA FDTLSLEAQI DRLYQEILGR APDDAGAAFW NAAAEAGFAI DDIAGAFVDA
PEFVGLALQP TGLDFLI