Gene TM1040_0568 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_0568 
Symbol 
ID4077919 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008044 
Strand
Start bp604197 
End bp605432 
Gene Length1236 bp 
Protein Length411 aa 
Translation table11 
GC content55% 
IMG OID638005865 
Producthemolysin-type calcium-binding region 
Protein accessionYP_612563 
Protein GI99080409 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG2931] RTX toxins and related Ca2+-binding proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.294597 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0000308959 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGCGATTT TAACTTCAAG CGACTCAATT GACGCGATCT TATTTCCAAG CACCTTCATT 
TCCGGTGGAT ACATTTTCCC AACGCCTGTC TACAGATATG ATGTGACTGA CAGCTACATT
TCCTACACGC TTGACCAAAC GCGCGGCTCG ACCGGCTATT ACAACATAAA CCCAACTGTT
CGGATATTTG GGGACAATCT CAGTGTCGAC TCGTCTGGGC GTGCGTCAGG CACGATCACC
GCGATGGAGT TTCGCTCTTC TAATCGGGGC GAGATCGCAC GCATTGAGCA GATCAGCATC
AATGCATCCG ATTTCACGGA CATCATTTTC GCACGGATCT CCGGAGACAA TACGTCAAGC
ACACAATTCG AGACCCTGCT GGCAGAGGCT TTGGACGTTC TTGAATTTGG CAATCGCAAC
CAAGACATCT CTGACAGAGG CATGTTGCAG TATTTGAGCT TCATCGACCT GCAGGGAGGA
AACGACGAAT TCCATCTCGC GCGCCCCAAA ACTGACGGGA CACGCACAAT TGACGGCGGC
GCGGGTCAGG ACACGCTCCA TCTTGATGAC TTCGGCGTGC CCGACTCGTT TGTCGTGAAC
TTAAAGACAG GCCAGATCAT CACCGACTCA ACTTCGGTGA ATATCACTGG CTTTGAGATC
ATTGACGGCA ACCCCTTCGT CGATCGCTAT ATTGGTTCCA ACAGCGGTGA TCACATCAGA
GCAGCTGGTC GCGCCGATCA GATCAACGGG TTCGGGGGGC GCGACACCCT CTCTGGAGGA
TGGGGGGATG ACAGGATCAC GGGTGGCCGA GGCAAAGACA GACTTCATGG CGACGAAGGC
AACGACTTTC TGCGCGGGGA CGCGGGCGCA GATCTTCTTG TTGGTGGCGC AGGGCGGGAT
CGCCTCGTCG GCCGCGCCGG GCAGGACACG CTCATCGCAG ATGACGGGCG GGACCGCCTC
ATCGGCGGAG CGGGTTCGGA CCTCTTCGTG TTCAATCTCA GCGGCTCTGG AAGCAAGATC
CGCGACTTTG ACATCTCTGA GGGAGATCAC ATCCGACTCG ACACAGACGG GAGTTACGCC
TTTGACACCG ATAGTCTGAA ACTGACGCGC TCTGGCTTTC GGATAAACAC AATCGACTCA
GACTCGAATG TCGAGACGCT TCGGGTCGTG CTGAATGATG ACGCGAGACA TGACCTCAGC
CTCGACGCGC TGTGGGACGT CCTGACATTT GGCTGA
 
Protein sequence
MAILTSSDSI DAILFPSTFI SGGYIFPTPV YRYDVTDSYI SYTLDQTRGS TGYYNINPTV 
RIFGDNLSVD SSGRASGTIT AMEFRSSNRG EIARIEQISI NASDFTDIIF ARISGDNTSS
TQFETLLAEA LDVLEFGNRN QDISDRGMLQ YLSFIDLQGG NDEFHLARPK TDGTRTIDGG
AGQDTLHLDD FGVPDSFVVN LKTGQIITDS TSVNITGFEI IDGNPFVDRY IGSNSGDHIR
AAGRADQING FGGRDTLSGG WGDDRITGGR GKDRLHGDEG NDFLRGDAGA DLLVGGAGRD
RLVGRAGQDT LIADDGRDRL IGGAGSDLFV FNLSGSGSKI RDFDISEGDH IRLDTDGSYA
FDTDSLKLTR SGFRINTIDS DSNVETLRVV LNDDARHDLS LDALWDVLTF G